Method For Detecting And Purifying Pancreatic Beta Cells

ABSTRACT

The invention is based, in part, on the discovery that a polypeptide, referred to herein as Betacam, is selectively expressed on the surface of pancreatic islet cells. Thus, in one aspect, the invention is directed to compositions comprising Betacam or that can be used to detect Betacam. In another aspect, the invention provides methods of detecting (e.g., non-invasively) pancreatic beta cells from a mammalian cell source. Another aspect of the invention is directed to cellular purification of pancreatic beta cells from a heterogeneous cell source of multiple kinds. In another aspect, the invention provides methods of identifying agents that modulate activity of Betacam. In yet another aspect, the invention provides for improved treatment and diagnosis of diabetes.

RELATED APPLICATION(S)

This application is a continuation of U.S. application Ser. No. 13/128,181 filed on Nov. 5, 2009 which is the U.S. National Stage of International Application No. PCT/US2009/063417, filed Nov. 5, 2009, which designates the U.S., published in English, and claims the benefit of U.S. Provisional Application No. 61/198,763, filed Nov. 7, 2008. The entire teachings of the above applications are incorporated herein by reference.

GOVERNMENT SUPPORT

This invention was made with government support under Grant Numbers P30 DK057516 and U19 DK061248 awarded by the National Institutes of Health. The government has certain rights in the invention.

INCORPORATION BY REFERENCE OF MATERIAL ON COMPACT DISK

This application incorporates by reference the Sequence Listing contained in the following ASCII text file being submitted concurrently herewith:

File name: 37861039003SEQLIST.txt; created Dec. 22, 2014, 133 KB in size.

BACKGROUND OF THE INVENTION

Diabetes is a devastating disease that is caused by either the complete destruction of the pancreatic beta-cell (type I, or juvenile diabetes) or the deterioration of the function of such cells (Type II, or adult diabetes). Many people suffer from the diseases, with an estimated number in this country of app. 17.5 million diagnosed. As a percentage of the total population, this figure is rising. Associated health costs as outlined by the American Diabetes Association are estimated at $174 billion, of which one third is accredited to a loss of national productivity. The average cost for health care expenses for a diabetic person is 2.3 fold higher than in absence of diabetes, and is currently set at $11,744/year.

Temporal curing of diabetes has been achieved. Presently, a sparse supply of organ donor cadaveric human islets can be used to transplant a limited number of type I diabetic patients. Such recipients become insulin-independent, for periods now up to several years.

The etiology of the disease is related to the role of the pancreatic insulin-producing cell, the beta cell. The normal function of the pancreatic beta cell is to control blood glucose homeostasis, and in absence of such regulation several detrimental effects are observed in patients with the disease, even in the presence of intensive treatment. Such long-term complications affect the function of the kidneys (nephropathy), eye degeneration (retinopathy), loss of extremities by amputation (vascular complications) and diabetes is furthermore associated with increased cardiovascular risk, and results in a shortened lifespan.

In both type I and type II diabetes, focus is on the life and function of the pancreatic beta cell. This cell type is unique in multiple aspects, the most important being that it is the only cell type in the body capable of producing insulin. Consequently, a loss of such cells leads to insulin dependence. Type I or type II diabetes is diagnosed at a point where the function of such cells have decreased to a level not meeting initial demand for appropriate blood glucose lowering following a meal. It is generally believed, however, that if one could assess beta cell mass, and function prior to diagnosis, intervention strategies may be applied to circumvent the further demise of the failing cell population. For type I diabetes, current focus is on identifying the presence of circulating anti-islet auto-antigens, as such may help identify those children that are at-risk, or are overtly pre-diabetic due to an ongoing immune destruction. For type II diabetes, current focus is on establishing clinical testing, such as the use of oral glucose tolerance testing (OGTT), now suggested as a standard evaluation of males approaching 50 years of age. The result of an OGTT can help identify individuals that are at the pre-diabetic point, and interventions can be performed, mostly including counseling related to the benefits of lifestyle changes involving increased exercise, caloric intake, and balancing diets. In both cases, non-invasive imaging of the beta cell mass, if aided by reagents capable of marking the cell population, could substantially improve the diagnostic toolbox.

Presently, there is no method by which beta cell mass can be assessed in a non-biased manner non-invasively in human subjects. Accurate measurement of beta cell mass in pancreas is dependent on biopsy analysis followed by histological assessment of beta cell numbers and morphometric counting; in most cases obtained post-mortem through autopsy material. The amount of donor material reflecting on progressive disease development is therefore significantly limited, and kinetic studies on disease progression in an individual are impossible using this technology. Another important aspect related to a growing need for beta cell mass assessment is following islet transplantation. Although only performed in few individuals, this technology is becoming more widespread. It is carried out by isolating an islet-enriched cellular fraction from a human donor post-mortem, which is subsequently transplanted through portal vein injection into a HLA-matched type I diabetic recipient. Following transplantation, there is generally no means to assess the viability and health of the grafted islet cells, as these are inaccessible in the recipient's liver vascular system. A general assessment of graft function is determined by measurements of insulin-dependency, gradually lowering injected insulin as cells in the graft become capable of providing insulin. Often, multiple transplants are required, empirically defined based on outcome. There is no unbiased assessment of the actual viable islet cell mass that engrafts within the liver, and it cannot be followed. It should be mentioned that given the local production of insulin by such grafted islets, a local adipogenic effect occurs within the liver, and such changes may be measured non-invasively by MRI. However, in the best case, this only provides a read out of grafting efficiency; it is not able to accurately measure numbers/mass of viable grafted islet cells.

Thus, a need exists for methods of assessing beta cell mass and/or activity non-invasively.

SUMMARY OF THE INVENTION

The invention is based, in part, on the discovery that a polypeptide, referred to herein as Betacam, is selectively expressed on the surface of pancreatic islet cells. Thus, in one aspect, the invention is directed to compositions comprising Betacam or that can be used to detect Betacam. In another aspect, the invention provides methods of detecting (e.g., non-invasively) pancreatic beta cells from a mammalian cell source. Another aspect of the invention is directed to cellular purification of pancreatic beta cells from a heterogeneous cell source of multiple kinds. In another aspect, the invention provides methods of identifying agents that modulate activity of Betacam. In yet another aspect, the invention provides for improved treatment and diagnosis of diabetes.

Accordingly, in particular aspects, the invention is directed to an isolated nucleic acid that encodes an amino acid sequence of Betcam wherein the amino acid sequence comprises, consists essentially of, or consists of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO:2 or amino acids 30 through 463 of SEQ ID NO: 3. In other aspects the invention is directed to an isolated nucleic acid that encodes an amino acid sequence of an extracellular domain of Betacam, wherein the amino acid sequence comprises, consists essentially of, or consists of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15. In a particular aspect, the invention is directed to an isolated nucleic acid comprising, consisting essentially of, or consisting of SEQ ID NO: 26.

In another aspect, the invention is directed to an isolated polypeptide that comprises, consists essentially of, or consists of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO:2 or amino acids 30 through 463 of SEQ ID NO: 3. In another aspect, the invention is directed to an isolated polypeptide that comprises, consists essentially of, or consists of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15.

Also included in the invention are antibodies that have binding specificity for the Betacam polypeptide.

Another aspect of the invention is directed to a method of detecting beta cells in a mixture of pancreatic cells comprising detecting the presence of a polypeptide on the surface of the cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15, and detection of expression of the polypeptide on the surface of the cells indicates that the cells are pancreatic beta cells. The method can further comprise isolating the pancreatic beta cells from the mixture of cells.

Another aspect of the invention is directed to a method of detecting pancreatic beta cells in an individual in need thereof, comprising administering to the individual an agent that detects the presence of a polypeptide on the surface of the pancreatic beta cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15. This method can be used, for example, to determine whether an individual is at risk of developing diabetes, or to assess the beta cells of an individual that has diabetes (e.g., to determine the appropriate treatment needed for a diabetic patient or to assess the efficacy of a diabetic patient's existing treatment). In one embodiment, the individual has Type I diabetes, and in another embodiment, the individual has Type II diabetes. In yet another embodiment, the individual has had an islet cell transplantation.

In another aspect, the invention is directed to a method of isolating pancreatic beta cells from a mixture of pancreatic cells comprising contacting the mixture with a reagent that specifically binds to a polypeptide present on the surface of pancreatic beta cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15, thereby producing a combination. The combination is maintained under conditions in which the reagent binds to the polypeptide present on the surface of the pancreatic beta cells, thereby producing a complex of pancreatic beta cells bound to the reagent; and the complex is separated from the combination, thereby isolating pancreatic beta cells from the mixture of pancreatic cells.

In yet another aspect, the invention is a method of identifying an agent that modulates (e.g., inhibits; enhances) the biological activity of betacam comprising contacting a composition comprising a polypeptide, wherein the polypeptide has an amino acid sequence comprising SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15 with an agent to be assessed. The biological activity of the polypeptide in the presence of the agent is measured and compared to a suitable control, wherein if the polypeptide modulates the activity of the polypeptide in the presence of the agent compared to the control, then the agent modulates the biological activity of betacam. In a particular embodiment, the composition is one or more pancreatic beta cells.

BRIEF DESCRIPTION OF THE DRAWINGS

The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

FIG. 1 is a copy of the .html output page from NCBI Unigene number Mm.206911, for the mouse gene locus AI987662.

FIG. 2 is the output page for human CCDS locus 253012, listing the exon/intron structure of the human gene, isoform 2.

FIG. 3 shows the nucleotide sequence (SEQ ID NO: 25) and amino acid (450 amino acids) sequence (SEQ ID NO: 2) of human Betacam, isoform 2.

FIG. 4 is a copy of the .html output page from NCBI HomoloGene 18724, for the human locus LOC253012. The listed genes are the closest homologues (i.e. orthologues) in multiple species.

FIG. 5 is the html output page for human LOC253012, isoform 1 (SEQ ID NO: 1).

FIG. 6 is the html output page for human LOC253012, isoform 2 (SEQ ID NO: 2).

FIG. 7 is the output page from ENSEMBL, listing graphically the exon/intron structure of Betacam. It is encoded on the reverse strand of Chr. 7.

FIG. 8 is a tree-view of homologous proteins existing in the same species (homo sapiens). Node distances reflect phylogenetic divergence.

FIG. 9 is an evaluation of the presence of a signal peptide (SEQ ID NO: 28) in mouse Betacam. This is detected with high accuracy between positions 31-32.

FIG. 10 is an evaluation of the presence of a signal peptide (SEQ ID NO: 29) in human Betacam isoform 1. This is detected with high accuracy between positions 31-32.

FIG. 11 is an evaluation of the presence of a signal peptide (SEQ ID NO: 30) in human Betacam isoform 2. This is detected with high accuracy between positions 18-19.

FIG. 12 is an evaluation of the presence of a signal peptide (SEQ ID NO: 31) in monkey Betacam, long form (SEQ ID: 10). No signal peptide is detected.

FIG. 13 is an evaluation of the presence of a signal peptide (SEQ ID NO: 32) in monkey Betacam, short form (SEQ ID: 11). This is detected with high accuracy between positions 25-26.

FIG. 14 is the detection of transmembrane (TM) regions in mouse Betacam. A single TM domain is detected between pos. 352-pos. 373.

FIG. 15 s a graphical view of the presence and number of neural network predicted N-linked glycosylation residues in Betacam.

FIG. 16 is an extraction of GeneChip expression data for mouse Betacam. Expression levels are based on the Affymetrix MOE430v2 platform, similarly normalized (MAS5.0) and similarly scaled (500) expression data. Data are shown for Adult islets, isolated Phogrin-EGFP cells, corresponding to pancreatic beta cells, insulinoma (bTC), glucagonoma (aTC), embryonic pancreas and late stage embryonic pancreas obtained from an endocrine-cell devoid organ: Ngn3 KO. Supportive data based on EST-expression is shown below: ACEView, lists that the human gene, LOC253012 has been sequenced a total of 15 times from purified pancreatic islets.

FIG. 17 is an expression analysis graph of mouse Betacam. This is based on the Novartis SymAtlas datasets (http://symatlas.gnf.org/SymAtlas/) providing a wide-tissue batch expression analysis. The gene is generally absent from most tissues, except for Large, and Small intestine, and pituitary. Islets are not present in this set.

FIG. 18A is derived from Genepaint (www.genepaint.org). This is a freely available data resource based in in situ hybridization. Betacam is expressed at E14.5 in the CNS ventral cerebellum, and hypothalamic region. Outside the brain, expression is detected in both dorsal and ventral pancreas, but not elsewhere. FIG. 18B is a magnified view of FIG. 18A. The speckled expression patterns correspond to developing endocrine cells in the organ. FIG. 18C is an in situ hybridization. Expression is detected in the non-exocrine regions of the E18.5 mouse pancreas. FIG. 18D is a multiplex RT-PCR low cycle validation of Betacam expression using gene-specific primers. The gene is expressed in bTC cells. TBP is included as internal control. dbEST sequencing from NCBI Unigene lists that 18 and 24 instances of Betacam sequencing has been obtained from islets in mouse, and man, respectively.

FIG. 19 is a multiple alignment of Betacam proteins in 5 selected species (cf (SEQ ID NO: 33), mm (SEQ ID NO: 34), m (SEQ ID NO: 35), hs (SEQ ID NO: 36), pt (SEQ ID NO: 37) and consensus (SEQ ID NO: 38)). Nomenclature is assigned on the alignment for signal peptide, D1, D2, D3, TM domain, cytoplasmic domain. Proline 146 links D1-D2, Tyrosine 239 links D2-D3, the terminal end of D3 is highlighted by isoleucine 335. The “insert” sequence in D1 is a 13 amino acid stretch that is not present in NCAM, against which the protein structural fold-comparisons are done in the following.

FIG. 20 is a structural view of Betacam (left), threaded onto the NCAM fold (right), forcefield energy is shown. This was done using the Phyre threading serving, using the 3D coordinates of NCAM. D1, D2, D3 depicted on the figure. Green color denotes low energy (stable), blue to orange to red indicates conflicts. Figure created by the DeepView program.

FIG. 21 is a structural view of Betacam (left), threaded onto the NCAM fold (right), forcefield energy is shown. The Betacam molecule has been energy-minimized in DeepView, by adjusting side chain position, and 2× overall molecule energy minimization. The end result is a molecule in stable conformation. The numbering of the amino acid residues follow the sequential numbering of the cleaved form of Betacam. SEQ ID NO: 14 lists the amino acid sequence of the extracellular sequence of mouse Betacam, SEQ ID NO:15 lists the amino acid sequence of the extracellular sequence of human Betacam.

FIG. 22 is a structural view of Betacam in which non-conserved residues are labeled. A given region in D2, here named the “D2-kink” is the predominant region at which Betacam is not conserved. D1 is highly conserved in evolution. A region of D3 is also containing variation.

FIG. 23 is a structural view of Betacam in which residues highly likely to be glycosylated through N-linked glycosylation are labeled. Such Asparagine-residues are localized throughout the molecule.

FIG. 24 shows a clockwise rotation of molecule shown in FIG. 23. It is apparent that all high-probability N-linked glycosylation sites are localized on one side of the molecule (to the right in this figure). The leftmost view shows a surface charge-distribution of molecule.

FIG. 25 is a surface charge representation of Betacam. Red corresponds to rich electron density, negative charge, whereas blue corresponds to positive charge. The D1-D2 region creates a dipole.

FIG. 26 is a secondary structure assignment of D1. A total of 7 beta-sheets is present in D1, naming as on the figure. With yellow is highlighted the Arg41 and Asn54 residues. A 13 amino-acid insertion exists between these two residues, not modeled in the present comparison to NCAM, which lacks such an amino acid stretch at the corresponding position.

FIG. 27 is a secondary structure assignment of the full Betacam molecule. D1, D2, D3, all contain 7 beta-sheets, and all fold as Ig-type domains.

FIG. 28 is a magnified view of Domain 1, highlighting two residues (Phe26, His27) expected to insert into the acidic groove.

FIG. 29 is a magnified view of Domain 1, highlighting two residues (Leu33 (Leu1 in the molecule after SP cleavage), Lys34 (Lys2 in the molecule after SP cleavage) also expected to insert into the acidic groove. Lys2 is expected to contact glutamic acids 132 and 133.

FIG. 30 is an extraction view of the Betacam alignment sequence (cf (SEQ ID NO: 39), mm (SEQ ID NO: 40), m (SEQ ID NO: 41), hs (SEQ ID NO: 42), pt (SEQ ID NO: 43) and consensus (SEQ ID NO: 44)) in which the docking residues are highlighted.

FIG. 31 is an anti-parallel docking of two Betacam molecules, in which D1 docking residues insert into the acidic groove created by D1′/D2′, and D1′ docking residues insert into the acidic groove D1/D2. A resulting half-domain offset is visible in the interaction region. For both molecules, all N-linked glycosylation residues are facing outward, and will not interfere with the docking. The resulting dimers form a semi-circle, which is viewed (left and right) using 90-degree rotations.

FIG. 32 shows surface renditions of a total of 4 Betacam molecules docked using one D1/D1:D1′/D2′ interaction in the middle, and adding two additional Betacam molecules docking as D3/D3′ interactions. The resulting molecule will take the form of a spiral, in which the central region of the extended circle is pinched together, creating a figure-8 shape. The top and bottom of the figure-8 would be attached to the membrane of two interacting cells.

FIG. 33 shows surface renditions of a total of 4 Betacam molecules docked using one D1/D1:D1′/D2′ interaction in the middle, and adding two additional Betacam molecules docking as D3/D3′ interactions. Surface charge overlaid. Only two basic-regions (D1) of the molecule are present, those of the two Betacam molecules engaged in D3/D3′ interactions. Connecting additional Betacam molecules to these using anti-parallel D1/D2 regions would extend the spiral, and neutralize the surface charges.

FIG. 34 shows threading results of Drosophila Lachesin onto the NCAM fold structure.

FIG. 35 is a view of Drosophila Lachesin following energy minimization. The insert depicts Lachesin binding to the membrane using a GPI anchor.

FIG. 36 is a detection result of trans-membrane regions in Lachesin.

FIG. 37 is a magnified view of Drosophila Lachesin, Ig-domain 1. D1-bulge residues are shown (SEQ ID NO: 45, SEQ ID NO: 46).

FIG. 38 is a surface charge representation of Lachesin. The D1-D2 region contains an electric dipole.

FIG. 39 shows prediction of a signal peptide cleavage (SEQ ID NO: 47) in Lachesin. Cleavage is predicted to occur after residue Alanine 25.

FIG. 40 is a schematic description of chimeric molecules generated for the experimental assessment of Betacam function. C-terminal Flag (or His)-tagged full-length murine Betacam (mBetacam) (including signal peptide, D1, D2, D3 domains, transmembrane domain and the intracellular domain) was subcloned into pIRES vectors so that it also expressed green fluorescent protein (EGFP) or red fluorescent protein (DsRed) in a bicistronic mRNA messenger.

FIG. 41 is a schematic description of chimeric molecules generated for the experimental assessment of Betacam function. Generation of mBetacam domain-specific-Fc recombinant proteins. mBetacam D1 domain (aa 33-146), D2 domain (aa 146-239), D3 domain (aa 240-335), D1-D2 domain (aa 33-239), D2-D3 (146-335) were amplified by PCR and subcloned into vector pFUSE-hIgG1-Fc2 to facilitate the construction of Fc-fusion proteins.

FIG. 42 is a graphical presentation of the cloning vector pCMV-SPORT6.

FIG. 43 is a graphical presentation of the cloning vector pcDNA3.1D-V5-His-TOPO.

FIG. 44 is a graphical presentation of the cloning vector pIRES2-EGFP (SEQ ID NO: 48).

FIG. 45 is a graphical presentation of the cloning vector pIRES2-DsRed2 (SEQ ID NO: 48).

FIG. 46 is a graphical presentation of the cloning vector pFUSE-hIgG1-Fc2.

FIG. 47 is a graphical presentation of the cloning vector pGEX-4T3 (SEQ ID NO: 49, SEQ ID NO: 50).

FIG. 48 shows the result of production of GST-Betacam33-80 using bacterial cells. Coomassie-stained protein gel is shown.

FIG. 49 shows testing of rabbit anti-sera raised against GST-Betacam33-80 using Western blotting against GST-fusion proteins.

FIG. 50 shows testing of rabbit anti-sera raised against GST-Betacam33-80 using ELISA.

FIG. 51 shows testing of the “RAT” rabbit anti-sera raised against GST-Betacam33-80 using Western blotting against Fc-Betacam fusion proteins.

FIG. 52 shows immunohistochemistry of anti-Betacam antisera against sections of embryonic (E14.5 (top), E18.5 (middle)) and adult (2M) mouse pancreas (bottom), performed as a double-staining to insulin.

FIG. 53 shows purification of various forms of Fc-fragment fusion proteins to Betacam, during reducing and non-reducing conditions. The slower migration under non-reducing conditions signify the presence of disulfide bridges. Such are observed in Ig-domain1, -domain 2 and -domain 3 of Betacam.

FIG. 54 shows western blotting detection using anti-human Fc-antibodies, the presence of Fc-Betacam fusion proteins in the lysate of transiently transfected CHO cells.

FIG. 55 shows direct, one-step cell surface live labeling of pancreatic insulinoma cells (bTC) using fluorescent beads bound to Fc-Betacam fusion proteins.

FIG. 56 shows cell surface live labeling of pancreatic insulinoma cells (bTC) with Fc-Betacam versions as outlined on the figure. The experimental conditions are shown at the top.

FIG. 57 shows digital images of bead-aggregation of Fc-conjugates fluorescent beads.

FIG. 58 shows flow cytometry analysis of pre- and post-incubation, testing for YG-beads only aggregation. An image is shown below representing the same experiment.

FIG. 59 shows flow cytometry analysis of pre- and post-incubation, testing for Blue-beads only aggregation. An image is shown below representing the same experiment.

FIG. 60 shows flow cytometry analysis of pre- and post-incubation, for YG-beads and Blue-beads only, followed by aggregation. An image is shown below representing the same experiment.

FIG. 61 shows flow cytometry analysis of pre- and post-incubation, mixing for Fc-conjugated YG-beads, followed by aggregation. An image is shown below representing the same experiment.

FIG. 62 shows flow cytometry analysis of pre- and post-incubation, testing for Fc-conjugated Blue-beads aggregation. An image is shown below representing the same experiment.

FIG. 63 shows flow cytometry analysis of pre- and post-incubation, testing for Fc-conjugated Blue-beads and YG-beads aggregation.

FIG. 64 shows flow cytometry analysis of pre- and post-incubation, testing for D1-Fc-conjugated YG-beads aggregation. An image is shown below representing the same experiment.

FIG. 65 shows flow cytometry analysis of pre- and post-incubation, testing for D2-Fc-conjugated Blue-beads aggregation. An image is shown below representing the same experiment.

FIG. 66 shows flow cytometry analysis of pre- and post-incubation, testing for D3-Fc-conjugated Blue-beads aggregation, experiment 1. A significant number of aggregates is observed prior to incubation (P3 prior to incubation is 40.8%).

FIG. 67 shows flow cytometry analysis of pre- and post-incubation, testing for D3-Fc-conjugated Blue-beads aggregation, experiment 2. An image is shown below representing the same experiment.

FIG. 68 shows flow cytometry analysis of pre- and post-incubation, testing for D2/D3-Fc-conjugated Blue-beads aggregation. An image is shown below representing the same experiment.

FIG. 69 shows flow cytometry analysis of pre- and post-incubation, testing for D1/D2-Fc-conjugated YG-beads aggregation. An image is shown below representing the same experiment.

FIG. 70 shows imaging analysis of pre- and post-incubation, testing for heterodimer formation between D1 and D2, D1 and D3, D2 and D3.

FIG. 71 shows imaging analysis of pre- and post-incubation, testing for involvement of the Fc-domain in any interactions between D1, D2 and D3.

FIG. 72 shows analysis of stably transfected CHO cells overexpressing DsRed only (control) and Betacam-DsRed fusion proteins. The lower row of images contain a series of comparative pictures of cells overexpressing DsRed only, Betacam-DsRed, BetacamΔD1-DsRed (lacking Betacam domain D1) and BetacamΔD1/D2-DsRed (lacking Betacam domain D1-D2).

FIG. 73 shows fluorescent image of adjacent Att20-cells expressing Betacam-V5his full length Betacam. Detection of signal was performed by immunofluorescence detection of the V5 epitope. Staining is observed at the membrane attachment zone of the two cells.

FIG. 74 shows confocal fluorescent image of CHO cells, expressing Betacam-V5. Staining is observed at points-of-contact.

FIG. 75 shows western blot analysis for detecting Betacam protein stably expressed in multiple cell lines.

FIG. 76 shows western blot analysis for detecting Betacam protein stably expressed in multiple cell lines.

FIG. 77 shows immunofluorescence analysis of stably expressing Att20 cells, clone 12E, imaged for expression of V5-tagged Betacam fusion protein. Control cells are shown to the right.

FIG. 78 shows confocal immunofluorescence analysis of stably expressing Att20 cells, clone 12E, imaged for expression of V5-tagged Betacam fusion protein.

FIG. 79 shows flow cytometry analysis of CHO cells transiently transfected with Betacam forms, or control plasmid.

FIG. 80 shows cell aggregation assay, using HEK293 cells transiently transfected with EGFP control plasmid (top row) or Betacam-EGFP fusion protein (lower row). Over time, cell aggregates are detectable in Betacam-EGFP transfected cells, but not EGFP control cells.

FIG. 81 shows cell aggregation assay, using HEK293 cells individually transiently transfected with a combination of EGFP and DsRed plasmids (left) or Betacam-EGFP and Betacam-DsRed fusion proteins (Right). Aggregation of Red/Green cells occur in the Betacam fusion protein transfected populations only.

FIG. 82 shows an example of a nucleotide sequence (SEQ ID NO: 26) that encodes an extracellular domain of Betacam.

FIG. 83 shows the amino acid sequence (SEQ ID NO: 27) of PGEX-4T3-Betacam 33-80.

DETAILED DESCRIPTION OF THE INVENTION

The invention is based, in part, on the discovery that a polypeptide, referred to herein as Betacam, is selectively expressed on the surface of pancreatic islet cells. Specifically, as shown herein, expression of Betacam occurred in both normal and obese mouse pancreatic islet cells and was observed throughout the course of mouse pancreatic development. Expression of Betacam in human islets was validated through EST-sequencing, and the Betacam mRNA was detected 15 times through random sequencing of human islet cDNA.

The inventors named the gene locus in mouse, corresponding to Unigene Mm.206911, FIG. 1, Betacam. The encoded protein was henceforth named Betacam. Its NCBI gene ID is 101202. The current name of the gene as used by NCBI is “expressed sequence AI987662” (FIG. 1). The inventors name the gene locus LOC253012 in humans, its GeneID: 253012 (FIG. 2 and FIG. 3), and its corresponding to UniGene Hs.443169, Betacam as well. The inventors name the protein encoded by the human GeneID 253012 Betacam. These Betacam genes are part of a homologous gene group, conserved during evolution, and NCBI refers to this group as HomoloGene 18724 (FIG. 4). This group was recently named HEPACAM2 based on its weak similarity to the HEPACAM orthologous group. Betacam is a conserved gene between vertebrate species to the extent that it represents a single orthologous group. The closest immediate neighbor is represented by Homologene group 17652, HEPACAM, which displays <40% amino acid identity to the Betacam group.

The naming selected is one of convenience; none of the above genes/proteins are named based on previous knowledge of function, or expression in the liver. The arbitrary selection of gene/protein name Betacam is based on information provided within, where the inventors show that the protein is a member of a cell adhesion family group (-cam extension), and selectively expressed in pancreatic beta cells (beta-). It is believed that no other existing and described cell surface marker is known with a similar specificity of expression, as that displayed by the Betacam-encoded protein Betacam.

Betacam in humans exists in two alternative forms, isoform 1 (FIG. 5), and isoform 2 (FIG. 6). The difference between these forms lies in alternative exon-1 usage, explained by differential promoter usage. The predicted sequence of either form of the encoded proteins differs in the amino terminal sequence. Translation of isoform 1 gives a protein of 462 amino acids [SEQ ID NO: 1], translation of isoform 2 gives a protein of 450 amino acids [SEQ ID NO: 2].

Accordingly, particular aspects of the invention are directed to a composition (e.g., a cell adhesion modulating agent; pharmaceutical compositions) that comprises a select amino-acid sequence derived from proteins encoded by the group HomoloGene:18724, as shown in FIG. 2, or other species orthologues to HomoloGene group 18724, which comprise >50% amino acid identity over the extracellular domain, as shown in FIG. 19. In another aspect the composition comprises select amino acids in the extracellular, but not trans-membrane, or intracellular, portion of such proteins, examples of which are shown in SEQ ID NO: 14 and SEQ ID NO: 15. In yet another aspect, the invention is directed to a composition comprising amino acids derived from the D1-D2-D3 region of a Betacam protein, not less than 6 amino acids longs, and no longer than 330 amino acids. In a particular embodiment, the composition comprises an N-terminal acetyl group. Other aspects of the invention are directed to a variety of uses of Betacam.

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry, nucleic acid chemistry, and immunology, which are well known to those skilled in the art. Such techniques are explained fully in the literature, such as, Methods of Enzymology, Vol. 194, Guthrie et al., eds., Cold Spring Harbor Laboratory Press (1990); Molecular Cloning: A Laboratory Manual, second edition (Sambrook et al., 1989) and Molecular Cloning: A Laboratory Manual, third edition (Sambrook and Russel, 2001), (jointly referred to herein as “Sambrook”); Current Protocols in Molecular Biology (F. M. Ausubel et al., eds., 1987, including supplements through 2001); PCR: The Polymerase Chain Reaction, (Mullis et al., eds., 1994); Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York; Harlow and Lane (1999) Using Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (jointly referred to herein as “Harlow and Lane”), Beaucage et al. eds., Current Protocols in Nucleic Acid Chemistry John Wiley & Sons, Inc., New York, 2000).

In accordance with the present invention, an isolated polynucleotide (also referred to as an isolated nucleic acid) is a nucleic acid molecule that has been removed from its natural milieu (e.g., that has been subject to human manipulation), its natural milieu being the genome or chromosome in which the nucleic acid molecule is found in nature. As such, “isolated” does not necessarily reflect the extent to which the nucleic acid molecule has been purified, but indicates that the molecule does not include an entire genome or an entire chromosome in which the nucleic acid molecule is found in nature. The polynucleotides useful in the present invention are typically a portion of a gene (sense or non-sense strand) of the present invention that is suitable for use as a hybridization probe or PCR primer for the identification of a full-length gene (or portion thereof) in a given sample, or that is suitable for encoding a Betacam protein or fragment thereof. An isolated nucleic acid molecule can include a gene or a portion of a gene (e.g., the regulatory region or promoter), for example, to produce a reporter construct or a recombinant protein. An isolated nucleic acid molecule that includes a gene is not a fragment of a chromosome that includes such gene, but rather includes the coding region and regulatory regions associated with the gene, but no additional genes naturally found on the same chromosome. An isolated nucleic acid molecule can also include a specified nucleic acid sequence flanked by (i.e., at the 5′ and/or the 3′ end of the sequence) additional nucleic acids that do not normally flank the specified nucleic acid sequence in nature (i.e., heterologous sequences). Isolated nucleic acid molecule can include DNA, RNA (e.g., mRNA), or derivatives of either DNA or RNA (e.g., cDNA). Although the phrase “nucleic acid molecule” primarily refers to the physical nucleic acid molecule and the phrase “nucleic acid sequence” primarily refers to the sequence of nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a protein. In one embodiment, an isolated nucleic acid molecule of the present invention is produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis.

Another aspect of the invention are orthologous nucleic acids (e.g., orthologous gene). Orthologous nucleic acids (e.g., genes) are sequences or genes in different organisms that are direct evolutionary counterparts; that is, they are related by descent from a common ancestor. Orthologous nucleic acids normally have the same cellular function. Select members of the orthologous group of Betacam nucleic acids are displayed in FIG. 8. Through the animal kingdom, it is expected to most, if not all, species contain a member of the orthologous nucleic acid group for Betacam. Such nucleic acid members are in this invention considered as being part of the orthologous nucleic acid group of Betacam.

The minimum size of a nucleic acid molecule or polynucleotide of the present invention is a size sufficient to encode a protein having a desired biological activity, such as sufficient to form a probe or oligonucleotide primer that is capable of forming a stable hybrid with the complementary sequence of a nucleic acid molecule encoding the full length (e.g., natural) protein (e.g., under moderate, high or very high stringency conditions) or a biologically active fragment thereof, or to otherwise be used as a target or agent in an assay or in any therapeutic method discussed herein. If the polynucleotide is an oligonucleotide probe or primer, the size of the polynucleotide can be dependent on nucleic acid composition and percent homology or identity between the nucleic acid molecule and a complementary sequence as well as upon hybridization conditions per se (e.g., temperature, salt concentration, and formamide concentration). The minimum size of a polynucleotide that is used as an oligonucleotide probe or primer is at least about 5 nucleotides in length, and preferably ranges from about 5 to about 50 or about 500 nucleotides or greater (1000, 2000, etc.), including any length in between, in whole number increments (i.e., 5, 6, 7, 8, 9, 10, . . . 33, 34, . . . 256, 257, . . . 500 . . . 1000 . . . ), and more preferably from about 10 to about 40 nucleotides, and most preferably from about 15 to about 40 nucleotides in length. In one aspect, the oligonucleotide primer or probe is typically at least about 12 to about 15 nucleotides in length if the nucleic acid molecules are GC-rich and at least about 15 to about 18 bases in length if they are AT-rich. There is no limit, other than a practical limit, on the maximal size of a nucleic acid molecule of the present invention, in that the nucleic acid molecule can include a portion of a protein-encoding sequence or a nucleic acid sequence encoding a full-length protein. According to the present invention, an oligonucleotide probe (or simply, probe) is a nucleic acid molecule which most typically ranges in size from about 8 nucleotides to several hundred nucleotides in length. PCR primers are also nucleic acid sequences, although PCR primers are typically oligonucleotides of fairly short length that are used in polymerase chain reactions. PCR primers and hybridization probes can readily be developed and produced by those of skill in the art, using sequence information from the target sequence. (See, for example, Sambrook et al., supra or “Molecular Biotechnology,” Second Edition, by Glick and Pasternak, ASM Press, Washington D.C., 1998, pp. 555-590). Knowing the nucleic acid sequences of certain nucleic acid molecules of the present invention allows one skilled in the art to, for example, (a) make copies of those nucleic acid molecules and/or (b) obtain nucleic acid molecules including at least a portion of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, full-length coding regions, regulatory control sequences, truncated coding regions). Such nucleic acid molecules can be obtained in a variety of ways including traditional cloning techniques using oligonucleotide probes to screen appropriate libraries or DNA and PCR amplification of appropriate libraries or DNA using oligonucleotide primers. Preferred libraries to screen or from which to amplify nucleic acid molecule include mammalian genomic DNA libraries. Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., ibid.

In particular aspects, the invention is directed to an isolated nucleic acid that encodes an amino acid sequence of Betcam wherein the amino acid sequence comprises, consists essentially of, or consists of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO: 2 or amino acids 30 through 463 of SEQ ID NO: 3. In other aspects the invention is directed to an isolated nucleic acid that encodes an amino acid sequence of an extracellular domain of Betacam, wherein the amino acid sequence comprises, consists essentially of, or consists of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15. In a particular aspect, the invention is directed to an isolated nucleic acid comprising, consisting essentially of, or consisting of SEQ ID NO: 26.

Other aspects of the invention are directed to RNA molecules such as antisense and interfering RNA molecules specific for Betacam. As used herein, an anti-sense nucleic acid molecule is defined as an isolated nucleic acid molecule that reduces expression of a protein by hybridizing under high stringency conditions to a gene encoding the protein. Such a nucleic acid molecule is sufficiently similar to the gene encoding the protein that the molecule is capable of hybridizing under high stringency conditions to the coding or complementary strand of the gene or RNA encoding the natural protein. RNA interference (RNAi) is a process whereby double stranded RNA, and in mammalian systems, short interfering RNA (siRNA), is used to inhibit or silence expression of complementary genes. In the target cell, siRNA are unwound and associate with an RNA induced silencing complex (RISC), which is then guided to the mRNA sequences that are complementary to the siRNA, whereby the RISC cleaves the mRNA.

A recombinant nucleic acid molecule is a molecule that can include at least one of any nucleic acid sequence encoding a Betacam protein or other protein described herein. In one embodiment, a recombinant nucleic acid molecule is operatively linked to at least one expression control sequence capable of effectively regulating expression of the nucleic acid molecule(s) in a host cell. Preferably, a recombinant nucleic acid molecule is produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning).

A recombinant nucleic acid molecule includes a recombinant vector, which is any nucleic acid sequence, typically a heterologous sequence, which is operatively linked to the isolated nucleic acid molecule encoding a protein (e.g., Betacam), which is capable of enabling recombinant production of the protein, or which is capable of delivering the nucleic acid molecule into a host cell in vitro, ex vivo or in vivo, according to the present invention. Such a vector can contain nucleic acid sequences that are not naturally found adjacent to the isolated nucleic acid molecules to be inserted into the vector. The vector can be either RNA or DNA, either prokaryotic or eukaryotic, and preferably in the present invention, is a virus or a plasmid. Recombinant vectors can be used in the cloning, sequencing, and/or otherwise manipulating of nucleic acid molecules. Recombinant vectors are preferably used in the expression of nucleic acid molecules, and can also be referred to as expression vectors. Preferred recombinant vectors are capable of being expressed in a transfected host cell, and particularly, in a transfected mammalian host cell in vivo.

In a recombinant molecule of the present invention, nucleic acid molecules are operatively linked to expression vectors containing regulatory sequences such as transcription control sequences, translation control sequences, origins of replication, and other regulatory sequences that are compatible with the host cell and that control the expression of nucleic acid molecules of the present invention. The phrase “operatively linked” refers to linking a nucleic acid molecule to an expression control sequence in a manner such that the molecule is expressed when transfected (i.e., transformed, transduced or transfected) into a host cell. Transcription control sequences are sequences that control the initiation, elongation, and termination of transcription. Particularly important transcription control sequences are those that control transcription initiation, such as promoter, enhancer, operator and repressor sequences. Suitable transcription control sequences include any transcription control sequence that can function in a host cell according to the present invention. A variety of suitable transcription control sequences are known to those skilled in the art. Particularly preferred transcription control sequences include inducible promoters, cell-specific promoters, tissue specific promoters (e.g., insulin or Pdx1 promoters) and enhancers. Transcription control sequences of the present invention can also include naturally occurring transcription control sequences naturally associated with the protein to be expressed prior to isolation. In one embodiment, a transcription control sequence includes an inducible promoter.

One type of recombinant vector useful in a recombinant nucleic acid molecule of the present invention is a recombinant viral vector. Such a vector includes a recombinant nucleic acid sequence encoding a Betacam protein of the present invention that is packaged in a viral coat that can be expressed in a host cell in an animal or ex vivo after administration. A number of recombinant viral vectors can be used, including, but not limited to, those based on alphaviruses, poxviruses, adenoviruses, herpesviruses, lentiviruses, adeno-associated viruses and retroviruses. Viral vectors suitable for gene delivery are well known in the art and can be selected by the skilled artisan for use in the present invention. A detailed discussion of current viral vectors is provided in “Molecular Biotechnology,” Second Edition, by Glick and Pasternak, ASM Press, Washington D.C., 1998, pp. 555-590, the entirety of which is incorporated herein by reference. Suitable host cells to transfect with a recombinant nucleic acid molecule according to the present invention include any microbial, insect, or animal cell that can be transfected. Host cells can be either untransfected cells or cells that are already transfected with at least one nucleic acid molecule.

According to the present invention, the term “transfection” is used to refer to any method by which an exogenous nucleic acid molecule (i.e., a recombinant nucleic acid molecule) can be inserted into the cell. The term “transformation” can be used interchangeably with the term “transfection” when such term is used to refer to the introduction of nucleic acid molecules into microbial cells, such as bacteria and yeast. In microbial systems, the term “transformation” is used to describe an inherited change due to the acquisition of exogenous nucleic acids by the microorganism and is essentially synonymous with the term “transfection”. However, in animal cells, transformation has acquired a second meaning which can refer to changes in the growth properties of cells in culture after they become cancerous, for example. Therefore, to avoid confusion, the term “transfection” is preferably used with regard to the introduction of exogenous nucleic acids into animal cells, and the term “transfection” will be used herein to generally encompass both transfection of animal cells and transformation of microbial cells, to the extent that the terms pertain to the introduction of exogenous nucleic acids into a cell. Therefore, transfection techniques include, but are not limited to, transformation, electroporation, microinjection, lipofection, adsorption, infection and protoplast fusion.

As used herein, reference to an isolated protein, also referred to herein an an isolated polypeptide, in the present invention, including a Betacam protein, is a protein that has been removed from its natural milieu (i.e., that has been subject to human manipulation), and includes full-length proteins, fusion or chimeric proteins, or any fragment or homologue of such a protein. Such a protein can include, but is not limited to, purified proteins, partially purified proteins, recombinantly produced proteins, synthetically produced proteins, membrane-bound proteins, proteins complexed with lipids, soluble proteins and isolated proteins associated with other proteins. As such, “isolated” does not reflect the extent to which the protein has been purified. Preferably, an isolated protein of the present invention is produced recombinantly. In addition, and again by way of example, a “human Betacam protein” or a protein “derived from” a human Betacam protein refers to a Betacam protein (generally including a homologue of a naturally occurring Betacam protein) from a human (Homo sapiens) or to a Betacam protein that has been otherwise produced from the knowledge of the structure (e.g., sequence) and perhaps the function of a naturally occurring Betacam protein from Homo sapiens. In other words, a human Betacam protein includes any Betacam protein that has substantially similar structure and function of a naturally occurring Betacam protein from Homo sapiens or that is a biologically active (i.e., has biological activity) homologue of a naturally occurring Betacam protein from Homo sapiens as described in detail herein. As such, a Betacam protein can include purified, partially purified, recombinant and synthetic proteins. Another aspect of the invention is directed to modified or mutated Betacam polypeptides. According to the present invention, the terms “modified”, “modification”, “mutated” and “mutation” can be used interchangeably, particularly with regard to the modifications/mutations to the amino acid sequence of protein (or nucleic acid sequences) described herein.

In particular aspects, the invention is directed to an isolated polypeptide that comprises, consists essentially of, or consists of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO: 2 or amino acids 30 through 463 of SEQ ID NO: 3. In another aspect, the invention is directed to an isolated polypeptide that comprises, consists essentially of, or consists of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15.

Fusion proteins and chimeric proteins are also encompassed by the invention. A fusion protein is a protein produced by linking (typically recombinantly, although chemical and other types of linkage are encompassed by the invention) of a protein or peptide of the invention (e.g., Betacam or a variant or fragment thereof) to a fusion partner (fusion segment). Suitable fusion partners for use with the present invention include, but are not limited to, fusion partners that can: enhance a protein's stability; enhance or permit secretion of a protein from the host cell; provide other biological activity; and/or assist purification of a protein from a host cell (e.g., by affinity chromatography or affinity pull-down). A suitable fusion partner can be a protein or domain or fragment thereof of any size that has the desired function (e.g., imparts increased stability, solubility, action or activity; provides other activity; and/or simplifies purification of a protein). Fusion partners can be joined to amino and/or carboxyl termini of the protein of interest (e.g., Betacam), and can be susceptible to cleavage in order to enable straight-forward recovery of the expressed exogenous protein. A chimeric protein is similar to a fusion protein, and the terms may be used interchangeably, except that in the case of the chimeric protein, the fusion partner is most typically a second protein of interest (or a fragment thereof), such as a second protein with a desired biological activity. Accordingly, a chimeric protein may have the activity of each/both of the protein/peptide components, or a new activity resulting from the combination of protein domains.

In one preferred embodiment, proteins (including peptides and homologues) are produced using in vitro translation systems, such as systems based on reticulocyte lysate, wheat germ, yeast and bacteria. The systems preferably correctly post-translationally process the protein, e.g., by proteolysis and/or glycosylation. Products of in vitro translation systems are most typically used in the methods of the invention, although the invention is not limited to such products. As used herein, the term “homologue” or “variant” is used to refer to a protein or peptide which differs from a naturally occurring protein or peptide (i.e., the “prototype” or “wild-type” protein) by minor modifications to the naturally occurring protein or peptide, but which maintains the basic protein and side chain structure of the naturally occurring form. Such changes include, but are not limited to: changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or a few more 30 amino acid side chains; changes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or a few more amino acids, including deletions (e.g., a truncated version of the protein or peptide) insertions and/or substitutions; changes in stereochemistry of one or a few atoms; and/or minor derivatizations, including but not limited to: methylation, glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosylphosphatidyl inositol. A homologue can have enhanced, decreased, or substantially similar properties as compared to the naturally occurring protein or peptide. A homologue can include an agonist of a protein or an antagonist of a protein. Homologues can be the result of natural allelic variation or genetic polymorphism, or any natural mutation. A naturally occurring allelic variant or genetic polymorphism of a nucleic acid encoding a protein is a gene that occurs at essentially the same locus (or loci) in the genome as the gene which encodes such protein, but which, due to natural variations, has a similar but not identical sequence. Allelic variants typically encode proteins having similar activity to that of the protein encoded by the gene to which they are being compared. A single nucleotide polymorphism (SNP) is a DNA sequence variation occurring when a single nucleotide in the genome differs between members of a species, or between paired chromosomes in an individual. Due to variations between human populations, a SNP allele that is common in one geographical or ethnic group may be much rarer in another. In addition, variations in the DNA sequences of humans can affect how humans develop diseases and respond to pathogens, chemicals, drugs, vaccines, and other agents.

One class of allelic variants can encode the same protein but have different nucleic acid sequences due to the degeneracy of the genetic code. Allelic variants can also comprise alterations in the 5′ or 3′ untranslated regions of the gene (e.g., in regulatory control regions). Allelic variants are well known to those skilled in the art.

Homologues can be produced using techniques known in the art for the production of proteins including, but not limited to, direct modifications to the isolated, naturally occurring protein, direct protein synthesis, or modifications to the nucleic acid sequence encoding the protein using, for example, classic or recombinant DNA techniques to effect random or targeted mutagenesis.

According to the present invention, an isolated protein, including a biologically active homologue or fragment thereof, has at least one characteristic of biological activity of activity the wild-type, or naturally occurring reference protein (which can vary depending on whether the homologue or fragment is an agonist or antagonist of the protein, or whether an agonist or antagonist mimetic of the protein is described). In general, the biological activity or biological action of a protein refers to any function(s) exhibited or performed by the protein that is ascribed to the naturally occurring form of the protein as measured or observed in vivo (i.e., in the natural physiological environment of the protein) or in vitro (i.e., under laboratory conditions).

The biological activity of a Betacam protein of the invention includes homotypic cell adhesion between Betacam-expressing cells, including pancreatic beta cells. More particularly, a biological activity of Betacam according to the invention includes the homotypic association of a Betacam protein expressed on one Betacam-expressing cell to another Betacam protein expressed on a neighboring cell. This includes cells of the islet. Such biological activities of Betacam useful in the present invention include the generation and use of molecular components designed to bind to, activate, or inhibit, or otherwise modulate the function of Betacam. Modifications, activities or interactions which result in a decrease in protein expression or a decrease in the activity of the protein (complete or partial), can be referred to as inactivation, down-regulation, inhibition, reduced action, or decreased action or activity of a protein. Similarly, modifications, activities or interactions that result in an increase in protein expression or an increase in the activity of the protein, can be referred to as amplification, overproduction, activation, enhancement, up-regulation or increased action of a protein. The biological activity of a protein according to the invention, and particularly a Betacam protein, can be measured or evaluated using any assay for the biological activity of the protein as known in the art. Such assays can include, but are not limited to, binding assays (including a variety of immunological assays), assays to determine internalization or localization of the protein and/or associated proteins, and/or assays for determining downstream cellular events that result from the activity of the protein.

As used herein, unless otherwise specified, reference to a percent (%) identity refers to an evaluation of homology which is performed using: (1) a BLAST 2.0 Basic BLAST homology search using blastp for amino acid searches and blastn for nucleic acid searches with standard default parameters, wherein the query sequence is filtered for low complexity regions by default (described in Altschul, S. F., Madden, T. L., Schääffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:3389-3402, incorporated herein by reference in its entirety); (2) a BLAST 2 alignment (using the parameters described below); (3) and/or PSI30 BLAST with the standard default parameters (Position-Specific Iterated BLAST. It is noted that due to some differences in the standard parameters between BLAST 2.0 Basic BLAST and BLAST 2, two specific sequences might be recognized as having significant homology using the BLAST 2 program, whereas a search performed in BLAST 2.0 Basic BLAST using one of the sequences as the query sequence may not identify the second sequence in the top matches. In addition, PSI-BLAST provides an automated, easy-to-use version of a “profile” search, which is a sensitive way to look for sequence homologues. The program first performs a gapped BLAST database search. The PSI-BLAST program uses the information from any significant alignments returned to construct a position-specific score matrix, which replaces the query sequence for the next round of database searching. Therefore, it is to be understood that percent identity can be determined by using any one of these programs.

Two specific sequences can be aligned to one another using BLAST 2 sequence as described in Tatusova and Madden, (1999), “Blast 2 sequences—a new tool for comparing protein and nucleotide sequences”, FEMS Microbiol Lett. 174:247-250, incorporated herein by reference in its entirety. BLAST 2 sequence alignment is performed in blastp or blastn using the BLAST 2.0 algorithm to perform a Gapped BLAST search (BLAST 2.0) between the two sequences allowing for the introduction of gaps (deletions and insertions) in the resulting alignment. For purposes of clarity herein, a BLAST 2 sequence alignment is performed using, for example, the standard default parameters as follows.

-   -   For blastn, using 0 BLOSUM62 matrix:     -   Reward for match=1     -   Penalty for mismatch=−2     -   Open gap (5) and extension gap (2) penalties     -   gap x_dropoff (50) expect (10) word size (11) filter (on)     -   For blastp, using 0 BLOSUM62 matrix:     -   Open gap (11) and extension gap (1) penalties     -   gap x_dropoff (50) expect (10) word size (3) filter (on).

As used herein, reference to an “agonist” of a given protein refers to any compound that is characterized by the ability to agonize (e.g., stimulate, induce, increase, enhance, or mimic) the biological activity of the naturally occurring protein, and includes any homologue, binding protein (e.g., an antibody), agent that interacts with a protein or receptor bound by the protein, or any suitable product of drug/compound/peptide design or selection which is characterized by its ability to agonize (e.g., stimulate, induce, increase, enhance) the biological activity of the naturally occurring protein in a manner similar to the natural agonist, which is the reference protein.

Similarly, reference to an “antagonist” refers to any compound which inhibits (e.g., antagonizes, reduces, decreases, blocks, reverses, or alters) the effect of a given agonist of a protein (including the protein itself) as described above. More particularly, an antagonist is capable of acting in a manner relative to the activity of the protein, such that the biological activity of the natural agonist or reference protein, is decreased in a manner that is antagonistic (e.g., against, a reversal of, contrary to) to the natural action of the protein. Such antagonists can include, but are not limited to, a protein, peptide, or nucleic acid (including ribozymes, RNAi, aptamers, and antisense), antibodies and antigen binding fragments thereof, or product of drug/compound/peptide design or selection that provides the antagonistic effect. Homologues of a given protein such as Betacam, including peptide and non-peptide agonists and antagonists (analogs), can be products of drug design or selection and can be produced using various methods known in the art. Such homologues can be referred to as mimetics. Various methods of drug design, useful to design or select mimetics or other therapeutic compounds useful in the present invention are disclosed in Maulik et al., 1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., which is incorporated herein by reference in its entirety.

An isolated protein useful as an antagonist or agonist according to the present invention can be isolated from its natural source, produced recombinantly or produced synthetically.

As used herein, a mimetic refers to any peptide or non-peptide compound that is able to mimic the biological action of a naturally occurring peptide, often because the mimetic has a basic structure that mimics the basic structure of the naturally occurring peptide and/or has the salient biological properties of the naturally occurring peptide. Mimetics can include, but are not limited to: peptides that have substantial modifications from the prototype such as no side chain similarity with the naturally occurring peptide (such modifications, for example, may decrease its susceptibility to degradation); anti-idiotypic and/or catalytic antibodies, or fragments thereof; non-proteinaceous portions of an isolated protein (e.g., carbohydrate structures); or synthetic or natural organic molecules, including nucleic acids and drugs identified through combinatorial chemistry, for example. Such mimetics can be designed, selected and/or otherwise identified using a variety of methods known in the art. A mimetic can be obtained, for example, from molecular diversity strategies (a combination of related strategies allowing the rapid construction of large, chemically diverse molecule libraries), libraries of natural or synthetic compounds, in particular from chemical or combinatorial libraries (i.e., libraries of compounds that differ in sequence or size but that have the similar building blocks) or by rational, directed or random drug design. See for example, Maulik et al., supra.

In a molecular diversity strategy, large compound libraries are synthesized, for example, from peptides, oligonucleotides, carbohydrates and/or synthetic organic molecules, using biological, enzymatic and/or chemical approaches. The critical parameters in developing a molecular diversity strategy include subunit diversity, molecular size, and library diversity. The general goal of screening such libraries is to utilize sequential application of combinatorial selection to obtain high-affinity ligands for a desired target, and then to optimize the lead molecules by either random or directed design strategies. Methods of molecular diversity are described in detail in Maulik, et al., ibid.

In a rational drug design procedure, the three-dimensional structure of a regulatory compound can be analyzed by, for example, nuclear magnetic resonance (NMR) or X-ray crystallography. This three-dimensional structure can then be used to predict structures of potential compounds, such as potential regulatory agents by, for example, computer modeling. The predicted compound structure can be used to optimize lead compounds derived, for example, by molecular diversity methods. In addition, the predicted compound structure can be produced by, for example, chemical synthesis, recombinant DNA technology, or by isolating a mimetope from a natural source (e.g., plants, animals, bacteria and fungi). Maulik et al. also disclose, for example, methods of directed design, in which the user directs the process of creating novel molecules from a fragment library of appropriately selected fragments; random design, in which the user uses a genetic or other algorithm to randomly mutate fragments and their combinations while simultaneously applying a selection criterion to evaluate the fitness of candidate ligands; and a grid-based approach in which the user calculates the interaction energy between three dimensional receptor structures and small fragment probes, followed by linking together of favorable probe sites.

Also included in the present invention are antibodies and antigen binding fragments thereof that selectively bind to all or a portion (e.g., biologically active portion) Betacam, as well as the use of such antibodies and antigen binding fragments thereof in any of the methods described herein. Antibodies that selectively bind to a protein can be produced using the structural information available for the protein (e.g., the amino acid sequence of at least a portion of the protein). As used herein, the term “selectively binds to” refers to the specific binding of one protein to another (e.g., an antibody, fragment thereof, or binding partner to an antigen), wherein the level of binding, as measured by any standard assay (e.g., an immunoassay), is statistically significantly higher than the background control for the assay. For example, when performing an immunoassay, controls typically include a reaction well/tube that contain antibody or antigen binding fragment alone (i.e., in the absence of antigen), wherein an amount of reactivity (e.g., non-specific binding to the well) by the antibody or antigen binding fragment thereof in the absence of the antigen is considered to be background. Binding can be measured using a variety of methods standard in the art, including, but not limited to: Western blot, immunoblot, enzyme-linked immunosorbant assay (ELISA), radioimmunoassay (RIA), immunoprecipitation, surface plasmon resonance, chemiluminescence, fluorescent polarization, phosphorescence, immunohistochemical analysis, matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry, microcytometry, microarray, microscopy, fluorescence activated cell sorting (FACS), and flow cytometry.

According to the present invention, an “epitope” of a given protein or peptide or other molecule is generally defined, with regard to antibodies, as a part of or site on a larger molecule to which an antibody or antigen-binding fragment thereof will bind, and against which an antibody will be produced. The term epitope can be used interchangeably with the term “antigenic determinant”, “antibody binding site”, or “conserved binding surface” of a given protein or antigen. More specifically, an epitope can be defined by both the amino acid residues involved in antibody binding and also by their conformation in three dimensional space (e.g., a conformational epitope or the conserved binding surface). An epitope can be included in peptides as small as about 4-6 amino acid residues, or can be included in larger segments of a protein, and need not be comprised of contiguous amino acid residues when referring to a three dimensional structure of an epitope, particularly with regard to an antibody-binding epitope. Antibody-binding epitopes are frequently conformational epitopes rather than a sequential epitope (i.e., linear epitope), or in other words, an epitope defined by amino acid residues arrayed in three dimensions on the surface of a protein or polypeptide to which an antibody binds. As mentioned above, the conformational epitope is not comprised of a contiguous sequence of amino acid residues, but instead, the residues are perhaps widely separated in the primary protein sequence, and are brought together to form a binding surface by the way the protein folds in its native conformation in three dimensions. Accordingly, the present invention includes proteins or peptides comprising, consisting essentially of, or consisting of Betacam epitopes, as well as antibodies, antigen-binding fragments, or other binding partners (binding peptides) that bind to any epitope of a Betacam protein. An “isoepitope”, according to the invention, is an epitope that exists in variant forms or isoforms (naturally or by synthetic design), such as an epitope containing a polymorphic variant amino acid position. One of skill in the art can identify and/or assemble conformational epitopes and/or sequential epitopes using known techniques, including mutational analysis (e.g., site-directed mutagenesis); protection from proteolytic degradation (protein footprinting); mimotope analysis using, e.g., synthetic peptides and pepscan, BIACORE or ELISA; antibody competition mapping; combinatorial peptide library screening; matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry; or three-dimensional modeling. For example, one can use molecular replacement or other techniques and the known three-dimensional structure of a related protein to model the three-dimensional structure of Betacam and predict the conformational epitope of antibody binding to this structure. Indeed, one can use one or any combination of such techniques to define the antibody binding epitope.

Antibodies useful in the present invention can include polyclonal and monoclonal antibodies, divalent and monovalent antibodies, bi- or multi-specific antibodies, serum containing such antibodies, antibodies that have been purified to varying degrees, and any functional equivalents of whole antibodies. Isolated antibodies of the present invention can include serum containing such antibodies, or antibodies that have been purified to varying degrees. Alternatively, functional equivalents of whole antibodies, such as antigen binding fragments in which one or more antibody domains are truncated or absent (e.g., Fv, Fab, Fab′, or F(ab)2 fragments), as well as genetically-engineered antibodies or antigen binding fragments thereof, including single chain antibodies or antibodies that can bind to more than one epitope (e.g., bi-specific antibodies), or antibodies that can bind to one or more different antigens (e.g., bi- or multi-specific antibodies), may also be employed in the invention.

Genetically engineered antibodies include those produced by standard recombinant DNA techniques involving the manipulation and re-expression of DNA encoding antibody variable and/or constant regions. Particular examples include, chimeric antibodies, where the VH and/or VL domains of the antibody come from a different source to the remainder of the antibody, and CDR grafted antibodies (and antigen binding fragments thereof), in which at least one CDR sequence and optionally at least one variable region framework amino acid is (are) derived from one source and the remaining portions of the variable and the constant regions (as appropriate) are derived from a different source. Construction of chimeric and CDR-grafted antibodies are described, for example, in European Patent Applications: EP-A 0194276, EP-A 0239400, EP-A 0451216 and EP-A 0460617. Generally, in the production of an antibody, a suitable experimental animal, such as, for example, but not limited to, a rabbit, a sheep, a hamster, a guinea pig, a mouse, a rat, or a chicken, is exposed to an antigen against which an antibody is desired. Typically, an animal is immunized with an effective amount of antigen that is injected into the animal. An effective amount of antigen refers to an amount needed to induce antibody production by the animal. The animal's immune system is then allowed to respond over a pre-determined period of time. The immunization process can be repeated until the immune system is found to be producing antibodies to the antigen. In order to obtain polyclonal antibodies specific for the antigen, serum is collected from the animal that contains the desired antibodies (or in the case of a chicken, antibody can be collected from the eggs). Such serum is useful as a reagent. Polyclonal antibodies can be further purified from the serum (or eggs) by, for example, treating the serum with ammonium sulfate.

Monoclonal antibodies may be produced according to the methodology of Kohler and Milstein (Nature 256:495-497, 1975). For example, B lymphocytes are recovered from the spleen (or any suitable tissue) of an immunized animal and then fused with myeloma cells to obtain a population of hybridoma cells capable of continual growth in suitable culture medium. Hybridomas producing the desired antibody are selected by testing the ability of the antibody produced by the hybridoma to bind to the desired antigen.

In particular aspects, the invention is directed to an antibody that has binding specificity (e.g., epitopic specificity) for the Betacam polypeptide. In one embodiment, the invention is directed to an antibody that has binding specificity for a polypeptide that comprises, consists essentially of, or consists of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO: 2 or amino acids 30 through 463 of SEQ ID NO: 3. In another embodiment, the antibody has binding specificity for amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO: 2 or amino acids 30 through 463 of SEQ ID NO: 3. In another embodiment, the antibody has binding specificity for a polypeptide that comprises, consists essentially of, consists of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15. In yet another embodiment, the antibody has binding specificity for SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15.

Betacam proteins, homologues (including altered peptides), fragments, peptides, peptide and non-peptide mimetics, and antibodies and antigen-binding fragments thereof can be included in compositions and formulations. Such compositions, or formulations, can include a pharmaceutically acceptable carrier, which includes pharmaceutically acceptable excipients and/or delivery vehicles. As used herein, a pharmaceutically acceptable carrier refers to any substance suitable for delivering a composition, formulation or vaccine useful in the method of the present invention to a suitable in vivo or ex vivo site. Preferred pharmaceutically acceptable carriers are capable of maintaining the agent to be delivered (e.g., Betacam proteins, homologues (including altered peptides), fragments, peptides, peptide and non-peptide mimetics, and antibodies and antigen-binding fragments thereof) in a form that, upon arrival of the agent to a target cell or target site, the agent is capable of acting at that cell or site (e.g. capable of inducing an immune response). Suitable excipients of the present invention include excipients or formularies that transport or help transport, but do not specifically target an agent to a site (also referred to herein as non-targeting carriers). Examples of pharmaceutically acceptable excipients include, but are not limited to water, phosphate buffered saline, Ringer's solution, dextrose solution, serum-containing solutions, Hank's solution, other aqueous physiologically balanced solutions, oils, esters and glycols. Aqueous carriers can contain suitable auxiliary substances required to approximate the physiological conditions of the recipient, for example, by enhancing chemical stability and isotonicity. Suitable auxiliary substances include, for example, sodium acetate, sodium chloride, sodium lactate, potassium chloride, calcium chloride, and other substances used to produce phosphate buffer, Tris buffer, and bicarbonate buffer. Auxiliary substances can also include preservatives, such as thimerosal, m- or o-cresol, formalin and benzol alcohol. Compositions of the present invention can be sterilized by conventional methods and/or lyophilized. One type of pharmaceutically acceptable carrier includes a controlled release formulation that is capable of slowly releasing a composition of the present invention into an animal. As used herein, a controlled release formulation comprises an agent useful in the present invention in a controlled release vehicle. Suitable controlled release vehicles include, but are not limited to, biocompatible polymers, other polymeric matrices, capsules, microcapsules, microparticles, bolus preparations, osmotic pumps, diffusion devices, liposomes, lipospheres, and transdermal delivery systems. Suitable delivery vehicles also include, but are not limited to liposomes, viral vectors or other delivery vehicles, including ribozymes. Natural lipid-containing delivery vehicles include cells and cellular membranes. Artificial lipid-containing delivery vehicles include liposomes and micelles. A delivery vehicle of the present invention can be modified to target to a particular site in a patient, thereby targeting and making use of an agent at that site. Suitable modifications include manipulating the chemical formula of the lipid portion of the delivery vehicle and/or introducing into the vehicle a targeting agent capable of specifically targeting a delivery vehicle to a preferred site, for example, a preferred cell type.

The invention also extends to PET-tracer (positron emission tomography) methodology, related to the formulation of compounds derived from the Betacam sequence, which are traceable using PET methodology. A biomolecule that serves as a marker for some function can be labeled with an isotope that emits positrons—subatomic particles akin to positively charged electrons—such as 11C, 13N, and 15O, as well as 18F, which often replaces 1H. Upon tracer administration, emitted positrons interact with electrons from atoms in nearby tissues, usually within 1 mm. The collisions result in annihilation events that each simultaneously liberate two gamma rays at 180°. A scintillator (bismuth germanate is standard) and photomultipliers in the tomograph encircling the subject detect this energy pattern and extrapolate the approximate origin of the energy. About 500,000 annihilation events constitute a single slice or scan of body tissues. Scans allow visualization of the internal locations of the tracers, designed for measuring general body functions (e.g., glucose metabolism or blood flow in an organ) or highly specific functions (e.g., occupancy of a subtype of brain dopamine receptor). The intensity of emitted gamma radiation is proportional to the tracer concentration. Anatomical considerations and correlation of the range of radiation densities with colors of the spectrum provide quantitative and/or color representations of the magnitude of a physiological function. In humans, typical tissue resolution with PET is 4-7 mm—the width of a pencil. The potential for development of a PET system was realized in the 1950s, when interest in positron-emitting isotopes emerged. Invention of the PET scanner in the early 1970s is credited to Michael E. Phelps.

PET tracers can be divided into three broad categories based on what they measure, as described at http://pubs.acs.org/subscribe/journals/tcaw/10/i10/html/10mckenna.html. The first type of tracer provides general metabolic data, such as glucose uptake and protein synthesis, via labeled biomolecules (e.g., 11C-deoxyglucose and 11C-methionine, respectively) that leave the bloodstream and enter cells. The second type provides estimates for grosser physiological parameters, such as blood flow (e.g., 15O—H2O or 11CO2), and essentially remains in the bloodstream over the effective study duration. The third tracer type delineates and quantifies highly specific molecular targets, such as cellular receptors and transporters, for which tracers are either endogenous ligands or drugs (e.g., 11C-raclopride for the DA2 dopamine receptor). The high specific activity and sensitivity of PET tracers make them well suited for studying molecular targets present to low nanomolar concentrations. The design or selection of the optimal PET tracer integrates knowledge of cellular physiology and biochemistry, advances in radio- and synthetic chemistry, tracer pharmacology and kinetics, and refinements in cyclotron and PET technologies. Successful PET imaging requires consideration of the innate properties of both radioisotope and tracer molecule, and the route of tracer administration. Radioisotope selection criteria must include the ability to incorporate the isotope into the molecule of interest and the appropriateness of its radioactive half-life for the study design. Half-lives of positron emitters range from 20 min (11C) to 110 min (18F). All steps, from cyclotron generation to subject administration, must occur within the useful lifespan of the label—approximately 3 half-lives—to maximize the signal-to-noise ratio. Ideally, PET tracers have properties that accurately reflect what they are meant to measure and minimize radiolabeling effects on the parent molecule. Such properties include high target-site selectivity, specificity, sensitivity, minimal metabolism, and the attainment of equilibrium in the body during the study Inherent in tracer design is consideration of the tracer administration route. Intravenous tracers, by virtue of their rapid delivery, systemic distribution, and bypassing of gastrointestinal metabolism, are ideal and are most common. Inhalation, the second most prevalent administration route, suffers from complications related to swallowing, dispersal, physicochemical interactions between drug and vehicle, and patient-related issues. Ingested tracers are characterized by slower absorption, systemic distribution, and a greater chance of metabolites forming, making them much less attractive.

PET methodologies for detecting cells in-vivo: cancer. Early diagnosis and treatment of cancer are often crucial to a good prognosis. PET allows the detection of cancerous cells before tumors even form and can sometimes obviate the need for biopsies. PET's ability to detect tumors, determine malignancy and cancer progression, and ascertain cancer metastases stems from its capacity to assess the relatively higher energy needs of actively growing cancer cells. In particular for the present invention, the use of PET technology for detecting beta cells, is not dissimilar to that use of the PET technology of detecting cancerous cells. Glucose is in higher demand as an energy source by rapidly dividing cancer cells than by normal cells. Using a 11C- or 18F-labeled 2-deoxyglucose tracer (a non-metabolized glucose analogue), PET can detect cancer and establish a baseline tumor growth rate (the glucose utilization rate) in a patient. It can also assess antitumor activity during and after therapy. Successful therapy depends on eliminating tumor growth (metabolism), which is determined by decreased glucose uptake by tumor cells. Radiolabeled amino acids can be used in a similar way to deoxyglucose. Other indices of tumor growth, such as the extent and rate of tumor perfusion, and their projected decreases with treatment, can also be determined.

Cardiovascular Disease. PET has proved useful in the study and quantification of various aspects of heart and blood vessel function. As with cancer, clinical studies show an important role for PET in diagnosing patients, describing disease, and developing treatment strategies. PET has been applied in two major areas: assessment of coronary artery disease and impaired blood flow, and determination of the viability of heart tissue for revascularization. The latter helps physicians decide whether bypass surgery or heart transplant is a more viable option for a patient. Tracers that assess blood flow include 15O—H2O, 11CO2, and 13NH3, help establish the extent and progression of arterial blockage as well as the efficacy of drug therapy or surgery. They also are used to monitor the recovery and maintenance of a blockage-free state.

Central Nervous System. Conditions. PET can be used to diagnose functional brain disorders, such as Alzheimer's and Parkinson's diseases, childhood seizures, brain development disorders, and brain tumors. Cause and effect can also be investigated. In memory loss, PET can ascertain whether the loss is due to decreased blood flow, depression, or a molecular depletion, as in Alzheimer's disease. In addition, the appropriateness of therapies or interventions for these disorders can be monitored. PET even maps brain regions involved in specific activities, such as laughing, hearing, memory, and emotions, a useful function for planning neurosurgical procedures. PET also can measure the effects of drugs on region-specific brain functions. For a given drug, the capacity and occupancy of brain receptor molecules—the sites of action of antipsychotic drugs—and transporter molecules—associated with drug addiction and drugs of intervention—can be assessed. Tracers that bind to these molecules generate regional maps of receptors and transporters, estimate their occupancy by drugs of interest, and correlate drug occupancy with degrees of clinical efficacy. Examples of PET Radiocompounds include:

-   -   15O-oxygen Oxygen metabolism     -   15O-carbon monoxide Blood volume     -   15O-carbon dioxide Blood flow     -   13N-ammonia Blood flow     -   18F-fluorodeoxyglucose Glucose metabolism     -   18F-fluoromisonidazole Hypoxic cell tracer     -   11C-SCH23390 Dopamine DI receptor     -   11C-flumazenil Benzodiazepine receptor

Particular aspects of the invention relate to a specific homologous gene group, referred to as the Betacam group of genes. Aspects of the present invention also include a variety of methods that make use of the identification of Betacam as a novel cell surface determinant of pancreatic beta cells.

A current focus on developing methods to measure beta cells in vitro and/or in vivo is highly prioritized by the juvenile diabetes foundation, as well as the NIH through particular funding mechanisms. However, such reagents and/or methods would not be limited to screening of pre-onset diabetes development in otherwise healthy subjects. The application extends to tracking pancreatic insulin cells in multiple scenarios, ranging from applications related to increasing the success of islet cell transplantation, the generation of novel beta cells from non-islet sources, and the purification of pancreatic beta cells from heterogeneous cell populations of various kinds Such considerations relate to finding cell surface determinants present on normal beta cells, simultaneously absent from most other cells, as this criterion of specificity is an absolute requirement for further successful development of both such a reagent type, and the methodology related to its application.

Some considerations as to the importance and significance of succeeding in the above ventures are as follows. Diabetes is presently incurable, and islet transplantation is only offered to very few individuals. Yet, envisioning the cure is simple: replenishment of the lost cell pool of insulin-producing cells (type I diabetes, juvenile form), or restoration of insulin producing cells in late-onset diabetes (type II diabetes), as their function has deteriorated to a level leading to incomplete control of glycaemia. The availability of a universal cell replacement source for insulin-producing cells would have significant impact on total people suffering from the disease, and lowering of health-care associated expenses. Likewise, if early diagnosis of pre-clinical symptoms of diabetes could be achieved, it would be expected that total numbers of patients would be reduced given early intervention.

One aspect of the invention is directed to a method of detecting beta cells in a mixture of pancreatic cells comprising detecting the presence of a polypeptide on the surface of the cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15, and detection of expression of the polypeptide on the surface of the cells indicates that the cells are pancreatic beta cells. The method can further comprise isolating the pancreatic beta cells from the mixture of cells.

The mixture of pancreatic cells can be present in a variety of milieus. In one embodiment, the mixture of pancreatic cells are present in a biological sample. Examples of biological samples include pancreatic tissue (e.g., pancreatic donor tissue from, e.g., from a cadaver).

As will be appreciated by one of skill in the art, the Betacam polypeptide present on the surface of the beta cells can detected using a variety of reagents, and in particular labeled or tagged reagents, and methods for detecting such reagents. In one embodiment, an antibody that has binding affinity for the Betacam polypeptide is used. In another embodiment, since as shown herein homotypic interactions of Betacam/Betacam occur, a Betacam polypeptide can be used to detect the expression of Betacam on the surface of a pancreatic beta cell.

Particular aspects of the invention include the detection of pancreatic beta cells in pancreatic material obtained from a human cadaver. An application of the invention for this purpose could include contacting a labeled Betacam derived polypeptide, a Betacam reacting antibody, or a small engineered molecule (Betacam-derived reagent) designed to bind the Betacam protein surface, to a mixture of pancreatic beta cells (e.g., crude fractions of pancreatic cell suspensions, originating from donor pancreata).

It is known to those skilled in the art, that Betacam-derived molecules (i.e., nucleic acid, polypeptide) can be artificially labeled with multiple technologies, including fluorescent molecules, radioactive molecules (e.g., radioactive nucleide), enzymatic components (e.g., enzymes), select Tag sequence, PET-tracers, NMR tracers, or a drug for use in the methods described herein. Following contacting the cells with the molecule defined by the invention, measurements of labeling can be performed to assess islet cell purity as a function of total cell content based on the labeling component selected.

Crude isolated fractions of human islet preparations are often used for transplantation purpose for alleviating diabetic symptoms for extended periods (>1 year) in recipient individuals based on the more recent technology of islet cell transplantation as defined by the now commonly known “Edmonton protocol”. Enhancements of this particular protocol is envisioned as a particular application of the present invention. More specifically, an application of the invention would include tracking such islet cell preparations post-transplantation, as afforded by pre-contacting the islet cell preparation to a trackable Betacam-derived formulation, consequently labeling such and islet cell preparation. Considering that the trackable Betacam-derived reagent would facilitate non-invasive measurement of the transplanted cell population, methods for optimizing grafting methodology can be envisioned. Also, assessment of grafting, or transplantation, effectiveness, can be measured. This may be done very likely early following transplantation. Considering a certain stability of the Betacam-derived reagent, it may be possible to monitor the viability of the transplanted cells, which would be a benefit, as such measurements are not possible with existing technology.

Another aspect of the invention is directed to a method of detecting pancreatic beta cells in an individual in need thereof, comprising administering to the individual an agent that detects the presence of a polypeptide on the surface of the pancreatic beta cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15. This method can be used, for example, to determine whether an individual is at risk of developing diabetes, or to assess the beta cells of an individual that has diabetes (e.g., to determine the appropriate treatment needed for a diabetic patient or to assess the efficacy of a diabetic patient's existing treatment). In one embodiment, the individual has Type I diabetes, and in another embodiment, the individual has Type II diabetes. In yet another embodiment, the individual has had an islet cell transplantation.

Another particular embodiment would apply a Betacam-derived formulation as described above for injection intravenously into the bloodstream after which contact to a beta cell surface would occur. Upon binding of said molecule to the surface to beta cells, and the emittance of a signal based on a trackable moiety, beta cell mass may be measured in a patient separately from the parameters of glucose dependent insulin secretion assays, and separate from oral glucose tolerance tests, which reflects on basal islet cell functionality, but not total beta cell mass. As such, the invention may be used to develop non-invasive assays for detecting various degrees of beta-cell loss in a human individual, which are of relevance for prognosis of disease. During the progression of type I diabetes, prior to diagnosis of the disease, an ongoing autoimmune attack is known to gradually eliminate the beta cell population. Similarly, the detection of a progressive deterioration of the beta-cell mass, as it gradually is lost in type II pre-diabetic individuals, is of clinical relevance. Consequently, for either consideration, detecting an ongoing beta cell loss may be of significant value in guiding decisions of prediction, and prevention, of type I and type II diabetes, based on earlier intervention.

In yet another method, the invention includes the development of a reagent capable of purification of pancreatic beta cells from human pancreatic donor material. More particularly, a Betacam-derived polypeptide, a Betacam reacting antibody, or a small engineered molecule, would be contacted to crude fractions of pancreatic cell suspensions, originating from donor pancreata. If said Betacam-derived polypeptide; Betacam reacting antibody; or a small engineered molecule was previously conjugated or otherwise stably connected to a ligand, affinity-Tag moiety, or fusion protein domain which allows binding to a support material (e.g., plastic dish, plastic tube, sutures, membranes, ultra thin films, bioreactors, microparticles) or suitable matrix (e.g., polymeric matrix), cellular fractional enrichment either through centrifugal spinning, gravitational force, magnetic bead cell adhesion, flow-sorting or other fluid-pressure methodologies, enrichment of the Betacam-expressing cell population, including pancreatic beta cells could be achieved. Such enrichment would be expected to favorably improve on current transplantation clinical outcomes.

In a particular aspect, the invention is directed to a method of isolating pancreatic beta cells from a mixture of pancreatic cells comprising contacting the mixture with a reagent that specifically binds to a polypeptide present on the surface of pancreatic beta cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15, thereby producing a combination. The combination is maintained under conditions in which the reagent binds to the polypeptide present on the surface of the pancreatic beta cells, thereby producing a complex of pancreatic beta cells bound to the reagent; and the complex is separated from the combination, thereby isolating pancreatic beta cells from the mixture of pancreatic cells. The method can further comprise separating the pancreatic beta cells from the reagent.

In yet another method, the invention includes the development of technology leading to improved characterization of fraction enrichment of pancreatic beta cells from a heterogenous source of cells, including forward differentiated human embryonic stem cells. Current emphasis is presently on developing a universally available islet cell replacement cell resource, and particular efforts are directed on using human embryonic stem cells as a starting material. It is also known that such cells are pluripotent, and can adopt multiple cellular fates upon entering a differentiation process, which is controlled by an investigator. The promise of such cells is offset by the difficulties in specific directed differentiation method, which at present do not lead to a pure cell population of pancreatic beta cells. Other problems relate to the development of teratoma-type tumors upon transplantation to a live host. This particular problem is accredited to the co-transplantation of a limited set of undifferentiated, pluripotent, stem cells, existing along the more differentiated progeny. The consequence is detrimental, as such latent tumor forming capacity is posing a significant danger to a potential recipient. A solution to the problem would be to purify the insulin producing cell population to a level where such cells are not present. Therefore, in particular, one method of the invention would be to contact a Betacam-derived reagent to a forward-differentiated embryonic stem cell population, and purifying the insulin producing, Betacam expressing cells from contaminating non-endocrine cell types. Measures of purification capacity can be given in relative insulin expression per cell, or per DNA weight, as examples. Another measure of the purification can be given in the relative reduction o tumor forming capacity as events per million cells transplanted.

Accordingly, one, or more, embodiments of the invention relate to a method related to development of improved methods whereby the purification of pancreatic beta cells from any heterogenous source of cells can be achieved. It is known to those skilled in the arts that endocrine cells, including that of the pancreatic insulin-type, may possibly be derived from non embryonic stem cell sources. Regarding the emerging technologies of creating a universal cell source for diabetes treatment, these cover a wide area of investigative entries. Possible cell sources investigated as a means to this end includes in addition to human embryonic stem cells (hES), also hematopoietic stem cells (HSC), mesenchymal stem cell (MSC), multipotent adult progenitor cells (MPAC), pancreatic progenitor cells hPPCs), non-endocrine pancreatic epithelial cells (NEPECs), adult liver cells (Liver), adult GIP cells (K-cells), adult human duct cells (hDuct), adult human exocrine cells (hExocrine), genetically programmed transformed islet tumor cells, porcine embryonic pancreas (PEP), porcine islet cells (PIC), and ex-vivo expanded human islet cells. In all cases, the issue of initial clonal heterogeneity is a concern, and the end-point always defined by increasing the homogeneity of the end stage cell population. This invention relates to a method whereby purification of pancreatic beta cells from any such original heterogenous source of cells can be achieved.

Regarding the emerging technologies of creating a universal cell source for diabetes treatment, these cover a wide area of investigative entries, all having a similar end-point in common. The endpoint would be a glucose-responsive, insulin-producing cell, capable of being grafted into a human recipient. Possible cell sources investigated as a means to this end includes human embryonic stem cells (hES), hematopoietic stem cells (HSC), mesenchymal stem cell (MSC), multipotent adult progenitor cells (MPAC), pancreatic progenitor cells (PPCs), non-endocrine pancreatic epithelial cells (NEPECs), adult liver cells (Liver), adult GIP cells (K-cells), adult human duct cells (hDuct), adult human exocrine cells (hExocrine), genetically programmed transformed islet tumor cells, porcine embryonic pancreas (PEP), porcine islet cells (PIC), and ex-vivo expanded human islet cells. Other cell sources have been mentioned in literature, and the above list is not exhaustive, nor meant to be. For the remainder of the disclosure, all such cells are commonly referred to as a “progenitor cell source” (PCS), not requiring that such cells are defined as “progenitors” in the strict meaning of the word as employed by those skilled in the arts, but more generally applying the semantic use of “progenitor” as being a cell capable of change into another type, in this case pancreatic insulin-producing cells. Similarly, regarding the semantic use of “pancreatic beta cell” in this disclosure is not limited to the strict definition of a pancreatic beta cell by those skilled in the arts, but for the remainder of the disclosure encompasses any cell type capable of producing insulin, and secreting this hormonal product in response to extra-cellular glucose, which is hereby defined as the minimal set of requirements.

Another aspect of the invention is a method of identifying an agent that modulates (e.g., inhibits; enhances) the biological activity of betacam comprising contacting a composition comprising a polypeptide, wherein the polypeptide has an amino acid sequence comprising SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15 with an agent to be assessed. The biological activity of the polypeptide in the presence of the agent is measured and compared to a suitable control, wherein if the polypeptide modulates the activity of the polypeptide in the presence of the agent compared to the control, then the agent modulates the biological activity of betacam. In a particular embodiment, the composition is one or more pancreatic beta cells. A any biological activity of betacam can be measured such as homotypic cell adhesion between betacam-expressing pancreatic beta cells. In addition, as will be appreciated by those of skill in the art, a variety of suitable controls are available for use in the method. In one embodiment, the control comprises pancreatic beta cells which have not been contacted with the agent to be assessed.

The following examples are provided for the purpose of illustration and are not intended to limit the scope of the present invention. Each publication or other reference disclosed below and elsewhere herein is incorporated herein by reference in its entirety.

Example 1 Identification of Betacam as a Novel Cell Surface Molecule

In the following, a description of the amino acid sequence of the protein Betacam is provided. Homologous regions were easily detected for multiple species including human, pan troglodytes, canis familiaris, rat rattus, and gallus gallus, and several other species. Otherwise specified, the amino acid sequence numbering is in the following referring to that of mouse Betacam protein, isoform 1 [SEQ ID NO: 3], encoding a total of 463 amino acids.

In particular, the inventors named the gene locus in mouse, corresponding to Unigene Mm.206911, FIG. 1, Betacam. The encoded protein was henceforth named Betacam. Its NCBI gene ID is 101202. The current name of the gene as used by NCBI is “expressed sequence AI987662” (FIG. 1). The inventors name the gene locus LOC253012 in humans, its GeneID: 253012 (FIG. 2 and FIG. 3), and its corresponding to UniGene Hs.443169, Betacam as well. The inventors name the protein encoded by the human GeneID 253012 Betacam.

These above mentioned Betacam genes are part of a homologous gene group, conserved during evolution, and NCBI refers to this group as HomoloGene 18724 (FIG. 4). This group was recently named HEPACAM2 based on its weak similarity to the HEPACAM orthologous group. Betacam is a conserved gene between vertebrate species to the extent that it represents a single orthologous group. The closest immediate neighbor is represented by Homologene group 17652, HEPACAM, which displays <40% amino acid identity to the Betacam group.

The Betacam orthologous group contains multiple members in a large diversity of living species. The following sequence identities refer to individual members. SEQUENCE IDENTITY 1 [SEQ ID NO: 1] is human Betacam, isoform 1 (Homo Sapiens). [SEQ ID NO: 2] is human Betacam isoform 2 (Homo Sapiens). [SEQ ID NO: 3] is mouse Betacam isoform 1 (Mus Musculus). [SEQ ID NO: 4] is rat (rat rattus) Betacam. [SEQ ID NO: 5] is Bovine Betacam (Bos Taurus). [SEQ ID NO: 6] is dog Betacam (Canis familiaris). [SEQ ID NO: 7] is for Zebrafish Betacam (Danio Rerio). [SEQ ID NO: 8] is for horse Betacam (Equus caballus). [SEQ ID NO: 9] is for chicken Betacam (gallus gallus). [SEQ ID NO: 10] is for Chimpanzee Betacam (Pan troglodytes), [SEQ ID NO: 11] is for Macaque Monkey Betacam (Macaca Mulatta). [SEQ ID NO: 12] is for pufferfish Betacam (Tetraodon nigriviridis). [SEQ ID NO: 13] is for fruit fly (Drosophila Melanogaster) Lachesin.

The naming selected is one of convenience; none of the above genes/proteins are named based on previous knowledge of function, or expression in the liver. The arbitrary selection of gene/protein name Betacam is based on information provided within, where the inventors show that the protein is a member of a cell adhesion family group (-cam extension), and selectively expressed in pancreatic beta cells (beta-). The invention in particular is based on published knowledge that no other existing and described cell surface marker is known with a similar specificity of expression, as that displayed by the Betacam-encoded protein Betacam.

Betacam in humans exists in two alternative forms, isoform 1 (FIG. 5), and isoform 2 (FIG. 6). The difference between these forms lies in alternative exon-1 usage, explained by differential promoter usage. The predicted sequence of either form of the encoded proteins differs in the amino terminal sequence. Translation of isoform 1 gives a protein of 462 amino acids [SEQ ID NO: 1], translation of isoform 2 gives a protein of 450 amino acids [SEQ ID NO: 2].

In humans, Betacam resides on Chromosome 7, and is encoded on the reverse strand. It consists of 9 exons. (FIG. 7). This image was created using the genome browser “Ensembl” (www.ensembl.org).

A protein-Blast analysis of human Betacam protein sequence, isoform 1 against known human protein sequences was performed. This was done in order to assess which other protein share similarity to Betacam within the same species. A tree view is shown in FIG. 8, in which the Betacam protein is shown at bottom. It was observed that the closest homologues to Betacam are Follistatin-like 4, hemicentin, the CEACAM group, and finally Hepacam. The phylogenetic distance to Hepacam argued that Betacam/Hepacam2 are functionally distinct proteins.

As shown herein, mouse Betacam contained a clearly detectable signal peptide (FIG. 9). The confidence in predicting cleavage is 1.0. Cleavage will occur following amino acid Glycine 31. This occurs upon ER-docking, and the signal peptide recognition particle will facilitate docking of the nascent peptide chain emerging from a translating ribosome. Membrane-type components are known to contain signal peptides, in order to get routed to the membrane.

As also shown herein, human Betacam isoform 1 contains a clearly detectable signal peptide (FIG. 10). The confidence in predicting cleavage is 1.0. Cleavage will occur following amino acid Glycine 31.

Also shown herein, Betacam isoform 2 contained a clearly detectable signal peptide (FIG. 11). The confidence in predicting cleavage is 1.0. Cleavage will occur following amino acid Glycine 19, which is identical to Glycine 31 in isoform 1. The resulting protein of isoform 1, and isoform 2 types are consequently identical following pre-peptide cleavage.

Also shown herein is that the long form of Betacam from macacca mulatta [SEQ ID NO: 11], does not contain a signal peptide (FIG. 12), and the short form of Betacam from Macacca Mulatta does contain a signal peptide (FIG. 13).

Also shown herein is that Betacam contains a clearly detectable trans-membrane region, between amino acids proline 352 to tryptophan 373 (FIG. 14). Consequently, the protein encoded by Betacam is a single-pass trans-membrane spanning polypeptide, of from about amino acid leucine 32 to about serine 350 are extracellular in human Betacam, isoform 1 (SEQ ID NO: 1). Particular aspects of the invention relate to this extracellular portion of Betacam. The extracellular domain of Mouse Betacam is shown in SEQ ID NO: 14. The extracellular domain of Human Betacam is shown in SEQID NO: 15.

Referring now to the invention in more detail, the inventors show that Betacam contains a series of high-probability N-linked glycosylation asparagines residues. A prediction analysis was carried out using the Net-N-glyc neural-network based prediction server a Center for Biological Sequence analysis (www.cbs.dtu.dk). The consensus sequence is Asn-X-Ser/Thr, and such sequences are detected at 9 positions in the extracellular domain of Betacam (FIG. 15).

Example 2 Expression of Betacam on Pancreatic Insulin-Producing Beta Cells

Initial assessment of expression of Betacam using genomics-type data was performed (FIG. 16). Genomics data were either publicly available, or produced for the purpose of finding novel genes displaying beta-cell expression. To obtain data on expression, RNA was isolated for select tissues, or cell types. The resulting RNA was subjected to cDNA conversion, whereafter hybridization to select commercial genomics expression DNA-chips were performed. Such chips included types of Affymetrix, and Illumina commercial-type platforms. Expression data were normalized in software packages suitable for the need, including the GeneSpring analysis program and the Partek analysis program. Normalization of DNA chip scans were performed using the MAS5.0 algorithm for Affymetrix-type datasets, whereas Illumina type data sets were normalized using “average” normalization within the vendor-supplied program “BeadStudio”. Affymetrix scan data were all normalized to the arbitrary value of 500.

Oligonucleotide microarray experiments were performed on pancreatic-related samples using human U133 and mouse MOE430 Affymetrics chips that cover virtually the entire genome. Data was obtained from isolated islets from normal mice, diabetic models (NOD and ob/ob) and mice with deficiencies in the Ngn 3 as well as from mouse pancreatic tumor cell lines (aTC1-6 glucagonoma, βTC3 and Min6 insulinomas, and mPAC ductal tumor line). The data was analyzed to highlight transcripts that display islet cell-type-specific expression, and their segregation between pancreatic a- and β-cells.

Specifically, the advent of gene microarrays covering almost the complete spectrum of encoded mouse mRNAs (transcriptome) enabled the identification of the subsets of genes that are expressed in pancreatic islets. A number of published studies have documented genes that are expressed in pancreatic islet tissue, specific islet cell types and islet-derived cell lines (Shalev, 2002). In addition, studies have reported on the responses of islets to physiological and pathophysiological manipulation such as stimulation with glucose or inflammatory cytokines in vitro, and from mice carrying mutant genes that affect pancreatic function or development. The inventors performed more than 50 microarray experiments using both human U133 and mouse MOE430 oligonucleotide chips that report on virtually all transcripts from each species. This includes data from normal mice, diabetic models (NOD and ob/ob) and mice with deficiencies in Ngn3. Ngn3 null pancreas is completely devoid of pancreatic endocrine cells, and thus analysis at different gestational time points allowed the identification of transcripts that are highly expressed in the endocrine cells relative to exocrine and ductal tissue throughout development.

Further evaluation of tissue-specificity in-silico was made through queries against a larger series non-pancreatic type. Specifically, data were compared to array data obtained from a large non-pancreatic tissue pool of 45 tissue types (Novartis dataset and Unigene expression profiles). In addition, analysis of mouse pancreatic tumor cell lines (αTC1-6 glucagonoma, βTC3 and Min6 insulinomas, and mPAC ductal tumor line) further allowed the generation of predictive scores for select transcripts likely to display islet cell-type-specific expression, and their segregation between pancreatic α- and β-cells. These cell lines express genes related to the tumor cell phenotype and thus analyses were also performed on isolated pancreatic β-cell from a transgenic mouse expressing the autoantigen Phogrin linked to EGFP under the rat insulin 2 promoter. This resource was created by the inventors. The Table lists some of the genes for which transcripts were defined by ANOVA analysis, firstly as being differentially expressed in Ngn3 wild-type and knock-out mice at any embryological age (pancreatic endocrine and precursors) and secondly as being present in adult mouse islets. The list was then stratified on the basis of the relative expression in αTC and βTC cell lines. These approaches successfully predicted the islet cell specificity of the majority of the known transcriptional regulatory components involved in islet development, such as Ipf1, Arx, Pax4, Pax6, Brn4, NeuroD and known cell type specificity of several α- and β cell genes. Known neuroendocrine transcripts such as PTPRN (IA-2), prohormone convertases (Pcsk1, Pcsk2, Cpe) and the granins (Chga, Chgb Scg2, Sgne1) were in a pool of common αTC and βTC transcripts. Genes associated with other islet endocrine cells were, as predicted, not expressed in either (Ppy, Pyy, and ghrelin).

Referring now to the invention in more detail, the inventors describe in the following the discovery process of the Betacam gene. As an initial guide to the identification of pancreatic endocrine cell transcripts, E18.5 embryonic Ngn3 null pancreas was compared to WT littermate. This identified app. 180 individual transcripts that were absent in the endocrine-deficient pancreas, most of which correspond to known genes (the Table).

A limited number of transcript hits were previously uncharacterized, and further scrutinized by various prediction methods, results of which applying to AI987662/Betacam are shown in Example 1. Through this process, one particular gene, known as mouse gene AI987662 was discovered, which forms the basis of the particular aspects of the invention. Within the stratification method described in the Table, gene locus was observed within the pool of transcripts belonging to those >5-fold enriched in bTC cells versus aTC cells.

TABLE Gene transcripts that were depleted in the Ngn3 ko pancreas at e12.5, e15.5 or 18.5 were examined for expression in the endocrine cell lines αTC and βTC and adult islets. Components of islet endocrine cells are underlined; known autoantigens are highlighted in bold type. >5-fold enriched in αTC >5-fold enriched in βTC Not enriched in αTC or βTC Not expressed in αTC or βTC 1110005D19Rik 1100001E04Rik 7-Sep 9030612M13Rik 1810044E12Rik 1700040L02Rik 1110035L05Rik 9830160H19Rik 2310010I16Rik 2310014L03Rik 1700041C02Rik A430107J06Rik Abcc8 2310067E08Rik 2810431N21Rik 1810018P12Rik Aco1 Actr3 ank Aplp1 4731413G05Rik 2900052J15Rik 2010011I20Rik Atp1a1 AW011752 5133401E04Rik 6430527G18Rik 2310007H09Rik AW011752 Banf1 5730453H04Rik 6430527G18Rik 2610016M12Rik BC016198 BC042620 5930418K15Rik 7420452D20Rik 2700049B16Rik BC061928 C130083N04Rik 6430401D08 6720464I07Rik 9430022M17Rik 2900001G08Rik C230068E13 8430421H08Rik 9430023B20 3100002J23Rik C820002P14Rik Calm1 9030425P06Rik AA589382 9530058B02Rik 3110018A08Rik Capza2 Ccnb1 Ccnh Ccni Ace2 Acvr2 Apoa1 Arfgef1 A630013F22Rik Apoa2 Arx 3110050F08Rik Cda08-pending Cdc5I Asah2 B230312I18Rik B230206N24Rik 5830437M04Rik Cdkn2d Cgef2-pending BC027756 BC054438 Braf B230309E09Rik 5930418K15Rik Chga Chqb Chic1 Clcn3 C030034I22Rik B430319H24Rik Btg2 9330186A19Rik Cotl1 Cpe Csnk1d C130047D21Rik Cald1 Car2 CGI-141- 9830147J24Rik D16lum22e D7Ertd743e C130099L13Rik C1qb C3 pending Copg2as2 A530058N18Rik D9Wsu20e Ddx9 Donson C430010P07Rik Cdw92 D6Ertd253e Ednra Eno2 A930001M12Rik Dscr2 Emb Emb Foxa2 Ceacam2 Cfh Cpne3 Ctss Epb7.2 FBp2 Fev-pending A930009L07Rik Adcy7 Gna11 Gng5 H2-D1 Dnajc13 E130113K08Rik Foxf2 Galnt7 Gcg Gfra1 Adra2a AI173274 AI315068 Hdac2 Hmgcr Hmgn3 Ecm1 Enah Fabp1 Fabp4 Glcci1 Gpr30 Gstt2 Hes1 AI987662 Ang Asc-pending Hmgn3 Hnrpab Hnrpu Fbxl12 Fcgr2b Fgl2 Flt1 Hs3st1 Ier3 Irx2 Itih2 Kap Atp2a3 AW125421 Hspa5 lerepo4-pending Isl1 Foxa3 Frzb Gbp2 Gca LOC224093 Mttp Pde3a B630019K06Rik Khdrbs3 Kif11 Kif5b Ghrl H2-Ab1 Hba-a1 Pde3a Pou3f4 Rbp4 Rgs4 B930068K11Rik BCO26600 LOC218490 LOC226144 Homez Hpvc2 Il6ra Insrr Sbsn-pending Sdc4 Sdc4 BC052055 Bicc1 Bok Cat LOC231887 LOC240396 Jarid1c Klf9 LOC214424 Slc38a1 Soat1 Spp1 Tfpi Cav2 Cd44 Crip Crp Map3k7 Matr3 Matr3 L0C56628 Lyzs Lyzs Lyzs Tle6 Trf Ttr D930029E11Rik Dach2 Dcx MGC65558 MGC6694 Mapk14 MGC25863 Mglap 4930459B04Rik Ttyh1 Dpep1 Dpp4 Dscr1I1 Ebf3 Mrps16 Mtch2-pending Mta3 Narg2 Ndel1 Nedd9 Ttyh1 Vegfc Vldlr Zdhhc14 Eif2s3y Elovl2 F13a Ndr3 Neurod1 Nkx2-2 Nov Pah Pkhd1 Ppy Pyy Zfp52. Frabin-pending G6pc-rs Np95 Paxip1 Pcsk1n Rbp7 Ret Rgpr-pending Gch Gck Gipr Glp1r Pcsk2 Pctk1 Pfdn1 Pitpnb S100a6 S100a8 Scp2 Gna13 Gpr27 H2-D1 Prnp Psk1-pending Psmb3 Siat8c Sst Sycp3 Tacstd1 Hlxb9 Hpca Hspa12a- Ptprn Pttg1 Rab6 Rad21 Timp3 Tm4sf3 Tnfrsf11b pending Iapp Ins1 Ins2 Ramp2 Ranbp1 Rbpms Tor3a Tpra40-pending Insm1 Ipf1 Iqgap1 Krt2-8 Rcn2 Refbp2 Resp18 Trim44 Tsll2-pending Lgals2 Lhx2 Lmwdsp20- Risc-pending Rnpc2 Scq2 Usp15 Usp47 Utx Vcam1 pending LOC194126 Sdfr1 Sfrs3 Sgne1 Smc4l1 Waspip Zfp219 Zfp36l1 LOC215866 LOC328644 Spi1-1 Sqle Ssb Syt13 Zfp40 Zfpn1a2. Maob Mbc2 MGC47419 Syt7 Syt7 Tmpo Tomm20- Myo7b Necab2-pending pending Txnrd1 Ube1c Nmi Nnat Npy Nudt7 Ubl3 Ubqln2 Vdu1-pending Papss2 Papss2 Pclo Wwp4-pending Xlr3a Ppp1r1a Prcad-pending Ywhab Ywhaz Yy1 Prkcb Pvrl3 Ramp1 Rasd1 Zfp364. Rasgrf1 Sepp1 Slc12a7 Slc2a2 Slitl2 Stx3 Svil Sytl4 Tec Tnnt1 Ubap1 Wbscr14.

Using the additionally available genomics datasets, it was found that expression of Betacam occurred in both normal, and obese mouse pancreatic islet cells (FIG. 16). Expression of Betacam was observed in Flow-cytometry sorted (FACS) mouse pancreatic β-cells, as defined by phogrin-EGFP expressing cells at embryonic day E18.5, corresponding to a time when pancreatic beta cells are recently formed. Phogrin is selectively expressed by the pancreatic beta cell, and not other pancreatic endocrine cells. Expression of Betacam was observed in Flow-cytometry sorted (FACS) mouse pancreatic β-cells, as defined by phogrin-EGFP expressing cells at postnatal day 24 and 25, corresponding to weaning age, a time when the pancreatic β-cells are mature. Expression of Betacam was observed in pancreatic glucagonoma cells (αTC) at low levels, and much increased in pancreatic insulinoma cells (βTC), indicating beta-cell enrichment. Expression of Betacam was observed throughout the course of mouse pancreatic development, increasing at E13.5-E14.5, which is known as the secondary transition, and is the time when pancreatic beta cells start to form. Expression of Betacam was lost in mouse pancreas of Ngn3 null genotype, but present in WT littermates. Expression of Betacam in human islets was validated through EST-sequencing, and the Betacam mRNA was detected 15 times through random sequencing of human islet cDNA.

It was also found that Betacam is expressed in the pituitary, small intestine and large intestine, but not elsewhere as judged from a genomics screen of multiple tissues (FIG. 17).

It was also found that expression of Betacam as performed by GenePaint (www.genepaint.org), is observed in developing pancreas at E14.5, with a centrally regionalized expression pattern corresponding to that of developing endocrine cells (FIG. 18A). Outside the pancreas, expression is noted in limited regions of the developing brain (FIG. 18B). In situ hybridization was performed and revealed expression of Betacam in developing E18.5 pancreas, corresponding to endocrine cells (FIG. 18C). Radioactive labeling and RT-PCR analysis was performed and revealed expression of Betacam in pancreatic insulinoma cells (βTC). EST sequencing confirms the presence of Betacam mRNA in cDNA libraries from both mouse and man (FIG. 18D).

Example 3 Structural Prediction Analysis of the Folding Conformation of the Extra-Cellular Region of Betacam

The data on the predicted structural fold of the Betacam extracellular portion was based on a method known as “threading”, and provides to those skilled in the art, a confidence level basis for understanding an unknown 3-dimensional fold, as judged through comparisons to other similarly folding proteins. The resource that was applied to obtain these results is known as “Phyre”, provide by the imperial College, U. K. (http://www.sbg.bio.ic.ac.uk/phyre/). Uploading the amino acid sequence for mouse Betacam, the Phyre resource first performed a Psi-blast homology detection algorithm analysis using sequence identities for proteins for which a 3-dimensional structure has been solved, and is publicly available. Hits passing a minimal threshold were used for a “threading analysis” in which the novel protein sequence, in this case that of Betacam, was threaded through the fold libraries represented by the Psi-blast hit table. 3-dimensional “fits” were evaluated based on energy considerations, and if the unknown sequence is capable of adopting a fold structure similar to that of one, or more, of the library folds. The results were presented such that a 3D coordinate set for the unknown protein sequence, congruent to the library hit, was downloaded. A score, relating to the confidence of the particular prediction analysis was obtained. A detailed description of phyre is found within “Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre. By Bennett-Lovsey R M, Herbert A D, Sternberg M J E, Kelley L A. Proteins: Structure, Function, Bioinformatics, vol 70(3) 611-625 (2008).”

Based on results of above, it was found that the extracellular domain consists of 3 Ig-type domains. Based on 3-dimensional modeling to distant relatives using the Phyre threading service, the structural fold of 95% of the extracellular domain, ranging from amino acid leu32 to Isoleucine 306 was predicted, and was fitted to the NCAM 3-dimensional fold, represented by the structure known in the Brookhaven protein data bank as structure c1qz1A.pdb.

Based on above results, a homotypic interaction method for Betacam/Betacam protein dimers was predicted. The homotypic interaction basis allows for the specific design of Betacam-derived single-molecules that are capable of interfering with Betacam/Betacam interactions, or binding to Betacam proteins localized on the beta cell surface.

FIG. 19 is a multi-alignment of Betacam protein sequences from 5 selected vertebrate species upon which a domain-assignment was provided. 3 Ig-type domains were detected, and named D1, D2, D3, where D1 represents the domain located most N-terminally, D2 is between D1 and D3, and D3 occupies a position C-terminally, closest to the membrane. For convenience, the amino acid numbering for the structural analyses were provided based on cleavage of the signal peptide. Consequently, this trimmed off the first 32 amino terminal residues, and amino acid 1 (leucine) corresponds to amino acid 33 (leucine) when the signal peptide is present. Amino acid residues located at the junction between these domains are as follows: Pro114 is the bridging residue between D1 and D2, Tyrosine 207 and Tyrosine 208 bridge between D2 and D3 and Isoleucine 303 is the most C-terminally located residue in the fold prediction analysis. The sequence ³⁰⁴TSVGLEKLAQRGKSLS319 (SEQ ID NO: 51) corresponds to the remaining extracellular sequence, which is not modeled and predicted to behave as a linker between D3 and the transmembrane domain. A series of amino acids are present in Betacam, from Arginine 41 to Asparagine 54, which are not present in NCAM, and cannot be modeled. his 12 amino acid stretch (⁴²SHTMPKYLLGSV⁵³ (SEQ ID NO: 52)) is referred to as the “D1 loop”.

FIG. 20 is structural view of Betacam (left) compared to NCAM (right), force field view. No minimization occurred, and the conformation of Betacam was not stable. The image was created by the DeepView program, which is a freely available program for structural analysis of proteins, using pdb-coordinate sets. Information on the Deepview program, also known as the Swisspdb Viewer, is available at http://www.expasy.org/spdpv/.

FIG. 21 is structural view of Betacam (left) following energy minimization. A significant gain in energy was attained, the molecule was more stable, and side chains were in a more favorable conformation. The structure stability approaches that of NCAM (right)

FIG. 22 is structural view of Betacam (left) following energy minimization, where labeling of amino acids displaying incomplete conservation during evolution has been provided. Residues in D1 were generally conserved. A patch of residues, on the kink-region of D2 “D2-kink”, are diverging during evolution. This area of the molecule was expected to be solvent accessible, and unlikely to be involved in protein/protein interactions, as such interactions would require a stronger evolutionary fixation. A region/patch of amino acids in D3 is non-conserved.

FIG. 23 shows the position of highly probably N-linked glycosylation sites on the Betacam molecule as predicted using the Net-N-Glyc server. Such sites were found within the regions corresponding to D1, D2 and D3.

FIG. 24 shows the localization of the putative glycosylated asparagines shown in FIG. 24, with a rotation of the Betacam molecule. The two rightmost views display a one-sided, asymmetric localization of all of the Asparagine residues identified by the neural network prediction server. All of these were localized on the surface of the predicted fold. Suggestively, glycosylation protects one side of the Betacam protein. The other side is free to engage in protein/protein interactions. The leftmost view is identical to the middle representation of the Betacam molecule, except that surface-charge distribution has been visualized. It is clear that the D1/D2 area display an asymmetrical charge distribution.

FIG. 25 is a side-view of the Betacam molecule in FIG. 23, leftmost. This view emphasizes the asymmetrical distribution of charge (blue corresponds to positively charged areas, red corresponds to negatively charged areas). A dipole exists that covers the D1/D2 region. In the following, reference is made to the “acidic groove” present at the D1/D2 bridge region.

FIG. 26 shows with higher magnification the secondary structure elements of Betacam D1. This Ig-type domain consists of 7 beta-sheets, named in the figure as A-G. On the left-most side of the domain, amino acids Arginine 41 and Asparagine 54 are shown in yellow. The D1 loop, present between these residues, on the surface of D1, is the same surface as the one containing the glycosylated residues. In fact, Asparagine 54 is one of the putative glycosylated residues. The D1 loop is therefore not considered to interfere with protein/protein interactions occurring at the opposite side of the domain.

FIG. 27 shows the secondary structure of the Betacam molecule consisting of 3 Ig-domains, all of which consist of 7 beta-sheets. The bridging residues proline 114, and tyrosines 207-208 are shown.

FIG. 28 shows a magnified view of the basic region of D1, which is referred to as the “docking motif”. Two residues expected to insert into the acidic groove (Phenylalanine 26 and Histidine 27) are highlighted.

FIG. 29 shows a magnified view of the basic region of D1, which is referred to as the “docking motif”. Two residues, immediately neighboring those of Phenylalanine 26 and Histidine 27, Leucine 1 and Lysine 2, are highlighted. The Lysine 2 is positioned to contact the acidic residues (Aspartic acid 112, Aspartic acid 113) located in the acidic groove.

FIG. 30 shows the separation of the suggested docking residues (Leu1, Lys2, Phe26, His27) on the primary sequence of Betacam. These residues combinations are distantly located on the primary sequence, but proximal to each other in space.

FIG. 31 shows the docking of two molecules of Betacam, engaged in homotypic, anti parallel interaction. The docking involves interactions between the two molecules on the exposed, non-glycosylated sides, and assumes opposing dipole interactions, which buries the docking residues in the acidic groove region. The resulting dimer is offset by one-half Ig-domain. The consequence of this docking would be to create a dimer, in which two molecules of Betacam, attached to the cell surface of adjacent beta cells become closely connected.

FIG. 32 shows the docking of 4 molecules of Betacam, using D1/D2 interactions, and D3-D3 interactions. The model assumes D3-D3 homotypic interactions using the non-glycosylated face of the D3 domain.

Referring now to the invention in more detail, if one considers the capacity of D3-D3 homotypic interactions, the molecule will be capable of forming a multimeric structure between two Betacam-expressing cells. The iteration of the model described in FIG. 33 can be extended in either direction through the addition of Betacam molecules, which will extend the free ends of the D1/D2 domains. Under those circumstances, a multimeric Betacam structure forms with the shape of a spiral, the apexes of which will be connected by two interacting D3-domain at the surface of the cell. It is believed that such a structure allows for strong homotypic cell-cell interaction, and the formation of a tight junction type complex between interacting cells. The model therefore posits that Betacam is a tight junction molecule, expressed by certain cells of both higher and lower vertebrates.

In a comparative perspective related to a predicted physiological involvement of Betacam, FIG. 34 displays the threading results of the Drosophila melanogaster Lachesin protein GeneID: 36363, Flybase link FLYBASE:FBgn0010238 against mammalian NCAM. The prediction score is 100%. Lachesin is the closest homologue to Betacam in flies. Lachesin (Lac), a cell surface protein, is required for the proper morphogenesis of the Drosophila tracheal system. Data suggest that Lac regulates organ size by influencing cell length rather than cell number, and cell detachments, indicating a role for Lac in cell adhesion. Results from an in vitro assay further support that Lac behaves as a homophilic cell adhesion molecule. Lac co-localizes with Septate Junction (SJ) proteins, and ultrastructural analysis confirms that it accumulates specifically at this type of cellular junction (Development. 2004 January; 131(1):181-90). Although not described in detail in mammals, in the mammalian peripheral nervous system, nerve insulation depends on the integrity of paranodal junctions between axons and their ensheathing glia. Ultrastructurally, these junctions are similar to the septate junctions (SJ) of invertebrates. In Drosophila, SJ are found in epithelia and in the glia that form the blood-brain barrier (BBB). It has been described that Drosophila Lachesin (Lac), which is a SJ component, is required for a functional BBB in that organism. It is generally recognized that tight junctions regulate the barrier to paracellular permeability in chordate epithelia, including mammals; examples of another non-vertebrate organism such as the sea urchin, it has been shown that at its blastula stage, its epithelium lacks tight junctions and instead possesses septate junctions. Septate junctions are unique to non-chordate invertebrate cell layers and have a characteristic ladder-like appearance whereby adjacent cells are connected by septa. The similarity between Lachesin and Betacam, indicates that Betacam may be involved in formation of a septate-like adherens complex between pancreatic beta cells, which may help in forming proper beta-cell structure.

Again, in a comparative perspective to the invention related to a predicted physiological involvement of Betacam, FIG. 35 displays predicted membrane attachment of lachesin to the cell membrane using a GPI-anchor. As is shown in FIG. 36, lachesin does not contain a membrane spanning region. The entire molecule is present on the outside of the cell surface.

Again in a comparative perspective to the invention, a magnified view of lachesin is shown in FIG. 37, in which D1 has been enlarged Annotation of amino acid sequence TDSTPVFLSTGST (SEQ ID NO: 53) present in Lachesin is shown. This linker corresponds to the D1-bulge residues in Betacam, and is here modeled on the surface of Lachesin opposite to that of the likely interaction side. It is noteworthy that the linker presence is a conserved feature between fly Lachesin and vertebrate Betacam.

Again in a comparative perspective to the invention, the charge-distribution of lachesin is shown in FIG. 38. Similar to Betacam, the D1-D2 region of lachesin displays a similar electric dipole as that observed in Betacam. An acidic groove is present. This is conferred by Asp41 and Asp102. Lys56 is localized in position close to that represented by Betacam Lys2, and the N-terminal end of the molecule. Consequently, a model of lachesin homotypic interactions can be predicted following the exact geometry of that proposed for Betacam, involving a basic-residue docking into an acidic groove present at the D1-D2 link region.

Again, in a comparative perspective to the invention related to a predicted physiological involvement of Betacam, FIG. 39 displays the detection of a signal peptide in lachesin with high confidence.

Example 4 Generation of Antibodies Reacting to Betacam Peptides and Detection of Betacam Expressing Cells

Referring now to the invention in more detail, FIG. 40 and FIG. 41 display a schematic outline of various molecular forms of Betacam, designed and created by the inventors, and used in the following to functionally define the properties of the Betacam protein in relation to the specific claims of the invention. The generation of the various forms of Betacam were done by molecular biology techniques involving cloning DNA fragments into bacterially-propagated vectors, also known as plasmids. The vectors used for these particular purposes are commercially-available products, not uniquely designed for the functional analysis of Betacam, and derived polypeptides. More specifically, the vectors employed took advantage of multiple favorable features for analyzing membrane-type components.

The following plasmid backbones were applied to perform various functional assessments of Betacam. FIG. 42 is a vector map for pCMV-SPORT6. This vector allows for eukaryotic expression. FIG. 43 is a vector map for pCDNA13.1D/V5-His-TOPO, this vector allows for generation of V5- and His-tagged versions of Betacam, which then can be tracked by virtue of the added fusion protein tags. FIG. 44 is a vector map of pIRES2-EGFP. This vector provides a non-fused IRES-driven EGFP reporter that allows non-invasive tracking of expressing cells. It also allows for eukaryotic expression. FIG. 45 is a vector map of pIRES2-DsRED2. This vector provides a non-fused IRES-driven DsRed red fluorescent reporter that allows non-invasive tracking of expressing cells. It also allows for eukaryotic expression. FIG. 46 is a vector map of pFUSE-hIgG1-Fc2. This vector allows for the construction of Fc-fusion proteins. Such proteins are secreted due to the N-terminal presence of a signal peptide derived from the Interleukin2 protein. It allows for eukaryotic expression based on the presence of a hEF1/HTLV promoter. FIG. 47 is a map of pGEX-4T3. This plasmid allows for the generation of GST-fusion proteins, which can be purified by glutathione-conjugated column material. The vector is optimized for bacterial expression. The inventors inserted various defined nucleotide sequences from human Betacam into the above vectors as described in the following examples.

Referring now to the invention in more detail, FIG. 48 illustrates the production of an N-terminal fragment of Betacam as a GST-fusion protein (GST-Betacam33-80) in bacteria. The GST portion (Glutathione-S-transferase) is capable of binding Glutathione, hereby facilitating detection and purification of the fusion protein using commercial-type products. In the particular example, GST-Betacam33-80 was expressed in bacteria, using the pGEX-4T3 vector, and production was induced by IPTG. Protein samples from the lysed bacteria were analyzed using denaturing SDS-PAGE gel analysis, stained using a coomassie-based blue stain of the resulting gel. The uninduced bacteria (U) showed a background level of bacterial protein bands. Following induction (I), a prominent band at 30 kDa appeared, which corresponds to GST-Betacam33-80. This lysate was incubated with Glutathione-conjugated sepharose beads, which allowed binding of the GST-Betacam33-80. The fusion protein was excluded from the supernatant (SB), and detected with high specificity on the bead fraction (B). Incubation of the Bead fraction with free Glutathione released a minor fraction of the bead-bound GST-Betacam33-80. Three eluates revealed a similar results (eluate 1, 2, 3). The majority of GST-Betacam33-80 remained bound to the Bead fraction, and was deemed relatively insoluble. Given that purity was achieved, the Bead/GST-Betacam33-80 fraction was used to immunize rabbits against GST-Betacam33-80.

Referring now to the invention in more detail, FIG. 49 describes results using combinations of antisera from immunized rabbits. Hi-C and Bun-E are two independent rabbits immunized with GST-Znt8, another molecule unrelated to GST-Betacam33-80. RAT and CAT are two independent rabbits immunized with GST-Betacam33-80. “Randy's prep” is a third GST-fusion protein preparation. Pre-immune bleeds are the RAT and CAT animal sera, obtained prior to immunization. They were expected not to be able to react. The inventors observed that Hi-C and Bun-E rabbits react to GST-type protein of both GST-Znt8 and GST-Betacam forms. Similarly, rabbits CAT and RAT reacted to both GST-Znt8 and GST-Betacam forms. This was due to the presence of the GST moiety in both fusion proteins. This was validated by the reactivity of either rabbit to GST-only, last lane.

Referring now to the invention in more detail, FIG. 50 describes results using individual antisera obtained from the rabbits “CAT” (C6743) and “RAT” (R8252), analysed using ELISA for reactivity and binding to GST-Betacam33-80. In this assay, “RAT” antisera reacted with increasing titers upon later bleeds, whereas rabbit “CAT” did not react.

Referring now to the invention in more detail, FIG. 51 is a western blot showing that anti-betacam antibodies raised against GST-Betacam33-80 detected Fc-D1 and FcD1/D2, but not Fc forms of Betacam lacking D1. This confirmed that the antibodies raised reacted against D1. The construction and production of Fc-betacam proteins are described in Example 5.

Referring now to the invention in more detail, FIG. 52 describes results using the “RAT” antisera as primary detection antibody for Betacam, on frozen sections obtained from CD1 mouse pancreas, or embryonic tissue. The analysis was performed as a co-staining of Betacam with Insulin. Embryonic expression of Betacam was detected at E14.5 in individually scattered cells, also expressing insulin (top row). At the later embryonic stage, E18.5, at which pancreatic beta cells have aggregated, expression of Betacam was detected only in the insulin-producing cells (middle row). There was a general congruency between Betacam and insulin at this stage. In the adult islet (2 months old mice), Betacam was detected with complete congruency to insulin-producing beta cells (lower row). All pancreatic insulin-producing cells reacted with the anti-GST-Betacam33-80 antibody; all anti-GST-Betacam33-80 reacting cells also expressed insulin.

Example 5 Creation and Detection of Fc-Betacam Fusion Proteins

Construction details for the generation of Fc-fusion construct plasmids is outlined in the following. Various Betacam domains (e. g. D1, D2, D3, D1/D2, D2/D3) were amplified by using appropriate primer pairs. The amplified fragments were purified and digested by using the corresponding restriction enzyme sites, and ligated into the vector pFUSE-hIgG1-Fc2. The primer pairs used for amplification are as following:

NcoI (SEQ ID NO: 54) D1 sense: AATTCCATGGCTCTGAAGGTGACCGTGCCGTC BglII (SEQ ID NO: 55) D1 antisense: CACCAGATCTAGGATCATCGACAGTGACTT NcoI (SEQ ID NO: 56) D2 sense: AATTCCATGGCTCCTGTCATGAAGCCAATGGT BglII (SEQ ID NO: 57) D2 antisense: CACCAGATCTATATATGGTGGGCATAATGA EcoRV (SEQ ID NO: 58) D3 sense: GGCCGATATCTTATGGACCTTATGGACTTCA NcoI (SEQ ID NO: 59) D3 antisense: GACGCCATGGCTATGATGACTGTGAATCGAG NcoI (SEQ ID NO: 60) D1/D2 sense: AATTCCATGGCTCTGAAGGTGACCGTGCCGTC BglII (SEQ ID NO: 61) D1/D2 antisense: CACCAGATCTATATATGGTGGGCATAATGA StuI (SEQ ID NO: 62) D2/D3 sense: GGCCAGGCCTTCCTGTCATGAAGCCAATGGT NcoI (SEQ ID NO: 63) D2/D3 antisense: GACGCCATGGCTATGATGACTGTGAATCGAG

The following describes the methods applied by the inventors to prepare Fc-betacam fusion proteins using transient transfected Cos 7 cells. The calcium-phosphate transfection method was used for introduction of DNA into mammalian cells and was based on the formation of a calcium phosphate-DNA precipitate. The calcium phosphate facilitates the binding of the DNA to the cell surface. It is believed that the DNA then enters the cell by endocytosis. The procedure is routinely used to transfect a wide variety of cell types for either transient expression or for the production of stable transformants. The DNA is mixed directly with a concentrated solution of CaCl₂. This is then added dropwise to a phosphate buffer to form a fine precipitate. Aeration of the phosphate buffer while adding the DNA-CaCl₂ solution helps to ensure that the precipitate which forms is as fine as possible. This is important because clumped DNA will not adhere to or enter the cell as efficiently. Generally, a final CaCl₂ concentration of 60 mM is used for calcium phosphate transfections. The final volume of DNA-CaCl₂ should not exceed 1/10th of the volume of media in which the cells are plated. Cells should be seeded at a density such that on the day of transfection they are no more than 50% confluent. The optimal seeding density produces a nearly confluent dish of cells when they are harvested or split into selective media 48 hours after the transfection. This will vary for each cell line and is dependent upon their doubling time. Generally, cells are seeded at a density of 5×10⁵/60 mm dish or 1-2×10⁶/100 mm dish. Between 10 and 100 μg of DNA may be transfected.

Calcium Phosphate Transfection Procedure was performed as follows. Cos 7 cells were prepared for Transfection, by plating cells in 100 mm or 60 mm dishes at the required density. Generally, cells were seeded at a density of 1-2×106/100 mm dish or 5×107/150 mm dish. Cells were incubated overnight at 37° C. in a humidified CO2 incubator. On the following day, transfection was performed by 3-4 hours prior to transfection, the media was changed. At time of transfection, a transfection mixture was added to cells.

Specifically, as an example: For a 100 mm dish containing 10 ml of media, a transfection mix was made as in the following: To a 1.5 ml tube, add 62 μl of 2M CaCl2, adjust total Volume to 500 μl with sterile ddH2O, and mix gently. Add 10 μg DNA (Fc-betacam fusion constructs of various types), mix gently. On vortex, using a pasteur pipette, slowly add 500 μl 2×HBSS drop-wise. The transfection mix was hereafter incubated at room temperature for 15-20 minutes.

Subsequently, the precipitate was added dropwise to the media to the cells in dish. Cells were next incubated overnight at 37° C. in a humidified CO2 incubator.

Two days following Conditioned Media (CM) was isolated from the tissue culture dishes.

Fresh medium was added in a few cases for further production of Fc-betacam fusion protein.

Fc-fused Betacam Purification using HiTrap Protein A HP columns, GE Healthcare was done as follows, according to manufacturer's instructions:

1. Prepare working buffers:

Buffer Stock solution ddH2O Final Volume Binding Buffer  5 ml  45 ml 50 ml Elution Buffer 0.5 ml 4.5 ml  5 ml

2. Fill up the 5 ml syringe with working Binding Buffer. Remove the stopper and connect the column to the syringe with the provided adaptor.

3. Wash the column with Binding Buffer at 1 ml/min. Wash two times. ‘drop to drop” to avoid introducing air into the column.

4. Apply the corresponding samples of Conditioned Fc-Betacam Cos 7 Media, using a new 5 ml syringe fitted to the adaptor by pumping it onto the column, at 1 ml/min.

5. Wash the column with Binding Buffer at 1 ml/min. Wash two times, until no material appears in the effluent.

6. Prepare a collection tube by adding 30 μl of Neutralizing Buffer. Elute with 400 ul of working Elution Buffer.

7. Re-load the flow-through onto the column for another elution, collect into the same collection tube.

Analysis of produced Betacam polypeptides by Western Blotting was done as follows. First, samples were prepared for SDS-PAGE analysis, by mixing 40 μl protein sample, 5 μl sample reducing reagent and 5 μl sample buffer (5× Sample Buffer: 10% w/v SDS, 10 mM beta-mercapto-ethanol, 20% w/v glycerol, 0.2 M Tris-HCl, pH 6.8, 0.05% w/v bromophenol blue). Samples were mixed gently, spun down, and subsequently boiled for 10 minutes to denature the protein. A cooling step on ice for 3-5 min was followed by spinning the sample down briefly. Samples were loaded onto NuPAGE® Novex® Bis-Tris Mini gels (Invitrogen). A protein molecular weight marker was added. Gel was run at 120 until loading dye began to exit gel. For Blotting, fiber pads, filter papers, and PVDF membrane were initially soaked in transfer buffer (25 mM Tris, 192 mM glycine, 20% methanol, pH8.3). Bubbles trapped in the filters or filter pads were removed. The PVDF membrane was activated in methanol prior to soaking in transfer buffer. The top and bottom of the gel were cut off. The gel was equilibrated in transfer buffer for a few minutes. Hereafter, the transfer cassette was assembled, and loaded in transfer apparatus. The transfer apparatus was filled with with transfer buffer prior to turning on the power supply. Proteins were transferred at 200 mA per transfer apparatus for 1 hr. After transfer, membranes were rinsed in ddH2O and placed in 5% milk solution (5% non-fat milk in 1×TBST) for 60 minutes at room temperature on a shaker to block non-specific binding. Following blocking, the membrane was soaked in primary antibody solution (example: goat anti-human IgG Fc specific, 1:500) on a shaker for overnight at 4° C. Hereafter the membrane was rinsed in ddH2O and washed 3 times for 10 min/wash. To detect the primary antibody binding, membranes were incubated in secondary antibody solution (donkey anti goat IgG-HRP, 1:5,000) for 60 min at room temperature on a shaker. Hereafter, the membrane was rinsed in ddH2O. Followed by three washes at 10 min/wash. Detection of signaling was performed using the ECL western blotting detection system (Amersham™ ECL™ Western Blotting Detection Reagents, GE Healthcare). 1 mL of solution A and 1 mL solution B was mixed. Membranes were incubated for various lengths of time, more preferably at 2 min. The reportable exposures differed, e.g. 30 sec, 1 min, 2 min, given optimal detection.

A result of production of Fc-Betacam proteins is shown in FIG. 53. By western blotting, using anti human-Fc, individual proteins of Fc-D1, FcD2, Fc-D3, Fc-D1/D2 and Fc-D2/D3 were detected under reducing conditions (left picture) and non-reducing conditions (no beta-mercaptoethanol, no boiling prior to gel loading). For all fusion proteins a single band of the expected size was observed (left most gel image). In all cases, these proteins migrated with a slower-forming complex under non-reducing conditions (right-most gel). This signified the presence of the disulfide bridges predicted to exist in both D1, D2 and D3. The inventors sought to purify a Fc-D1/D2/D3 fragment of Betacam as well. This was unsuccessful following multiple attempts. As shown in subsequent examples the formation of intra-cellular aggrosomes was observed commonly during the expression of the full extracellular domain in eukaryotic cells. This problem was likely due to the formation of multi-protein aggregates in the endoplasmatic reticulum. This indicated that the protein may not be either produced, or secreted. To test for the latter possibility, the inventors performed a western blot of the COS7, Fc-expressing cells, in which protein lysates were obtained rather than secreted media. An image is shown in FIG. 54, revealing retention of various forms of Fc-Betacam in the COS7 cells. Most notably, retention is observed for D1/D2 and D2/D3.

Example 6 Binding of Betacam-Derived Polypeptides to the Cell Surface of Pancreatic Beta Cells

Bead adhesion and recruitment assay. To detect Betacam-derived polypeptides directly binding to the cell surface of pancreatic insulin producing cells, a single-step binding assay was developed, referred to as the recruitment assay. The assay takes advantage of pre-binding Fc-Betacam fusion proteins to fluorescent beads which have been pre-coated by protein A by the vendor (Bangs Laboratories, Inc.). The beads were subsequently washed and added to Betacam-expressing cells and control cells. Imaging was done to assess adhesion of the fluorescent beads to the cell surface of the cells after exhaustive washing.

A specific experiment was done as follows. First, beads were coated with Fc-Betacam fusion proteins. Twenty microliters of Protein A coated microspheres (Dragon Green dyed) were washed three times in 200 μl of 50 mM sodium borate (pH 8.2), resuspended in 50 μl borate buffer 0.2 M pH 8.5, and incubated with 20 μg goat anti-human Fc fragment specific antibody with gentle mixing overnight at 4° C. Beads were washed three times by centrifugation at 9,000 rpm for 10 min with borate buffer containing 0.3% immunoglobulin-free BSA (Sigma), and hereafter incubated with 50 μl of purified various Fe-fused Betacam versions (e.g. Fc-D1, Fc-D2, Fc-D3, Fc-D1/D2, Fc-D2/D3) during 3 hr at room temperature with gentle mixing. The concentration of the Fc-betacam fusion proteins ranged typically from (0.05-0.09 ug/ul). Beads were finally washed three times in 0.3% BSA borate buffer and resuspended in 100 μl of this buffer. Immediately before loading on the cells, any possible bead aggregates were disrupted with a 1 s ultrasound pulse using a probe sonicator (Vibracell, Bioblock). The microsphere adhesion and recruitment assay was done as follows. βTC6 cells were cultured in a 12-well plate, and 10 μl of bead solution was added per well. After a 1 hr incubation time, three washes with PBS-BSA 0.3% were performed. Hereafter, the cells were photographed under the fluorescence microscopy.

FIG. 55 describes results using various Fc-conjugated Betacam versions, bound to fluorescent beads, which were added to the media of growing bTC (insulinoma) cells, followed by repeated washing, and imaging. Fc-only beads do not attach to the surface of bTC cells. However, Fc-D1, Fc-D2, Fc-D2, Fc-D1/D2, Fc-D2/D3 conjugated fluorescent beads attach effectively to the surface of pancreatic insulinoma cells.

Surface binding of Fc-Betacam to insulinoma cells. The inventors tested for direct binding of Fc-Betacam to the surface of pancreatic insulinoma cells, bTC6. The bTC6 cell line was found to express Betacam mRNA, as outlined in Example 2. The specific experimental conditions for the results shown in FIG. 57 were as follows. βTC6 cells grown as adherent cells were washed 3 times with 1×PBS, 5 min/wash. Hereafter, cells were incubated with 50 μl purified various Fc-fused Betacam versions (e.g. Fc-D1, Fc-D2, Fc-D3, Fc-D1/D2, Fc-D2/D3) and Fc (as control) (or 1:5 dilutions), 1 hr at RT. Cells were then washed 3× in 1×PBS for 5 min at RT. Next, cells were fixed with 4% PFA, 15 min at RT. Hereafter, a subsequent series of washes, 3× in 1×PBS for 5 min at RT, were performed to remove fixative. Primary antibody (goat anti-human IgG Fc-FITC, 1:100) were dispersed onto cells, and left O/N at room temp. Next morning, primary Ab were drained off, Cells were washed in 1×PBS, 3 times 5 min/wash, at RT. Following mounting, using cover slips in mounting media w/DAPI, imaging was performed. The inventors found that Fc-domain only did not bind to the surface of bTC6 cells whereas Fc-D1, Fc-D2, Fc-D3, Fc-D1/D2, Fc-D2/D3 were all capable of surface retention. This experiment was not capable of resolving affinity differences between the various forms tested.

Example 7 Definition of Homotypic Adhesion Properties of Betacam

Development and application of a fluorescent bead aggregation assay. Bead aggregation assays. The beads used in this assay were Protein A-conjugated polystyrene microspheres, fluorescent yellow-green (YG) (excitation maximum of 445 nm and emission maximum of 500 nm, the fluorescent emission is observed green) and fluorescent blue (excitation maximum of 475 nm and 600 nm & emission maximum of 663 nm, the fluorescent emission is observed red), 1.0 um (Polysciences, Inc.).

To coat fluorescent beads with Fc-Betacam proteins, the following procedure was applied by the inventors. 100 μl of Protein A-conjugated microspheres were added to a 1.5 ml microcentrifuge tube. The microspheres were washed once in sodium acetate 100 mM, pH3.9 (which is a pH at which any impurities coupled to protein A will be eluted) and twice in 10 mM Hepes, 50 mM NaCl, pH 7.2, by mixing the buffer, centrifuging in a micro-centrifuge for 5-6 min at 10,000×g, and then removing the supernatant. The Fc-Betacam proteins were bound to the beads at a ratio of 5 μg of protein per 40 μl of beads suspension in 10 mM Hepes, 50 mM NaCl, pH7.2, 1 mM CaCl2 for overnight at 4° C. on an Eppendorf shaker (1,400 rpm). Hereafter, the coated beads were pelleted, washed twice, and resuspended in 100 μl of 10 mM Hepes, 50 mM NaCl, pH 7.2. The suspension was briefly (˜30 sec) sonicated to obtain single beads, as determined by microscopy. Pictures were obtained recording the dissociation. The amount of protein coupled to the beads was determined by taking an aliquot and pelleting it and resuspending in 2×SDS sample buffer, boiling for 10 min. The beads were subsequently pelleted, and the supernatant was immunoblotted with the anti-human IgG (Fc specific)-HRP conjugate after SDS-PAGE.

An assay for establishing domain-specific interactions of Betacam-derived polypeptides was developed based on flow automated cell/bead sorting analysis. The detection of fluorescent beads allowed integer bead count assessment using fluorescence intensity as measured through FACS analysis. Such integer measurements therefore allowed the detection of bead aggregation as it may be mediated through conjugation of specific protein interaction domains. Beads were conjugated O/N. Prior to aggregation, sonication of the individual beads/mixtures was performed to achieve initial disaggregation. Hereafter, beads were incubated for 90 minutes, and subsequently analyzed by FACS analysis for multimer-formation. Following the FACS analysis, a spread of the bead mixture was performed on a microscope slide, and digital imaging using epifluorescence microscopy was done to assess visually the presence/absence of bead aggregates.

For the aggregation assay, Blue beads and YG beads were then mixed (equal volume of each) appropriately for a final volume of 100 μl and 1 mM CaCl₂ was added to initiate aggregation. The samples were incubated at room temperature on an Eppendorf shaker (1,400 rpm), and at various time points, 10 μl aliquots were removed and diluted 30-fold with 10 mM Hepes, 50 mM NaCl, pH7.2, 1 mM CaCl₂, and analyzed with the BD LSR II flow cytometer, or under the fluorescent microscopy for image acquisition. FIG. 57 shows images of various combinations of fluorescent beads. In the following, these images were compared to quantitative assessment of protein interactions based on flow cytometry.

The extent of aggregation, as determined by the aggregate size and aggregate composition, was quantified with a BD LSR II flow Cytometer. A 2D density plot of the intensity of red fluorescence versus green fluorescence in each aggregate revealed the size distribution and composition of the aggregates. The percentage of aggregates containing more than one red or green bead indicated the propensity for heterophilic binding.

The inventors first tested for intrinsic abilities of the fluorescent beads to bind as homodimers, or heterodimers. YG-beads (green fluorescent) showed no intrinsic capacity for aggregation (FIG. 58). Blue beads (red fluorescent) showed no intrinsic capacity for aggregation (FIG. 59). Mixing YG and Blue beads showed no intrinsic capacity for heterotypic aggregates (FIG. 60). Conjugation of Fc-only to the YG-bead fraction did not allow for aggregation (FIG. 61). Conjugation of Fc-only to the Blue bead fraction did not allow for aggregation (FIG. 62). Mixing Fc-conjugated YG- and Blue-type beads did not lead to aggregation (FIG. 63). Testing for homodimer formation between individual Betacam immunoglobulin domains was next performed. D1-YG conjugated beads revealed a very weak interaction suggesting D1-D1 homodimers may form (FIG. 64). Similarly, D2-D2 weak interactions were detected (FIG. 65). In contrast, D3-D3 homodimers are prominent (FIG. 66), and exist even prior to sonication, suggesting intrinsic capacity of D3-conjugated beads to aggregate in a homotypic fashion. A second experiment was performed at which the sonication process was extended, reducing the presence of pre-formed D3-D3 homodimers. It was observed that the 90 minute aggregation period leads to a dramatic increase in numbers of D3-D3 homotypic aggregates (FIG. 67). Testing D2/D3 in a similar manner yields similar results as those observed for the D3-D3 interactions, arguing that homotypic interactions between D2/D3 is mediated through the D3 domain, and that D2 contributes little, if any, to the observed aggregation (FIG. 68). D1/D2 homotypic aggregates were similarly analysed, and the inventors here observed a dramatic formation of very large aggregates, following incubation. This interaction appears stronger than the observed D3-D3 interaction. Aggregates of >100 beads were observed during immunofluorescence detection (FIG. 69). The inventors next tested for heterodimeric binding of the various Immunoglobulin domins from Betacam. In no case did the inventors observe strongly forming heterotypic aggregates between D1 to D2 or D3, or D2 to D3 (FIG. 70). A control experiment in which Fc-conjugated blue beads were tested for binding to YG-beads conjugated to D1, D2 or D3 revealed no heterotypic interactions forming, providing evidence that the observed interactions are mediated by the D1/D2 and D3 domains (FIG. 71). The inventors concluded that the extracellular domains of Betacam exhibit two individual models in homotypic interactions, one involving the D1/D2 fragment, which is explained by antiparallel interactions, which are stronger than any D1-D2 interactions individually analyzed. The other interaction mode is based on D3-D3 homotypic interactions, which do not interfere with D1/D2 homotypic interactions.

The combined evidence of these interactions is strongly supportive of the proposed theoretical models of Betacam higher-order interactions as outlined in Example 2.

Example 8 Generation of Mammalian Cells Stably Expressing Betacam, and Analysis of Cell Surface Expression of Betacam

Referring now to the invention in more detail, FIG. 72 describes results from production of stably expressing cell lines for a Betacam-DsRED fusion protein. CHO

(Chinese hamster ovary) cells were transfected with pCMV-DsRED or pCMV-Betacam-DsRED. Selection of clones stably expressing the constructs was performed using G418 (neomycin). It was noted that loss of Betacam-DsRed colonies during the culture phase was prominent. Clonal homogeneity is apparent in both cases. DsRED distributes throughout the cell, whereas Betacam-DsRED becomes localized to strongly fluorescent aggregrates, predominantly in a perinuclear location. Such localization is commonly observed if proteins form intracellular aggregates, and generate “aggrosomes”, which a eukaryotic version of bacterial inclusion bodies. The formation of aggrosomes is a result of ER-stress. Often, apoptotic death is associated with this, and the cells are impaired in growth. The inventors subsequently produced stable clones expressing N-terminally truncated versions of Betacam-DsREd fusions. CHO-cells expressing Betacam-DsRed-ΔD1 and Betacam-DsRed-ΔD1/D2 were similarly selected by G418 treatment. Such cells revealed that Aggrosome formation was observed in cells expressing Betacam-DsRed-ΔD1, but not in cells expressing Betacam-DsRed-ΔD1/D2. The Betacam-DsRed-ΔD1/D2 fusion protein distributes over the entire cell, and seems incapable of aggregation in this assay.

Referring now to the invention in more detail, FIG. 73 describes results transiently overexpressing a V5-tagged full length Betacam construct in Att20 (mouse anterior pituitary tumor) cells. The motivation for selecting the pituitary cell line was based on the fact that Att20 cells are of neuroendocrine type, and producing adrenocorticotropic hormone (ACTH), and consequently more resembling pancreatic b-cells as compared to CHO cells. A shorter detection tag (V5 epitope) was selected considering is smaller size, and lower likelihood of interfering with normal Betacam functions. Detection of the fusion protein was facilitated by using a monoclonal antibody binding to the V5 epitope. The full length Betacam construct contained the intact pre-peptide of Betacam. Two cells adjacent to, and contacting, each other, both expressing the V5-tagged Betacam protein are shown. The inventors note that formation of aggrosomes is minimal. The cell-cell connection interface between the cells is enriched for V5-reactivity. The inventors conclude that Betacam is expressed on the cell surface of either cell, and enriched at cell/cell junctions.

Referring now to the invention in more detail, FIG. 74 describes results transiently overexpressing a V5-tagged full length Betacam construct in Chinese hamster ovary cells (CHO). Here, a confocal image scan was obtained better capable of resolving the interaction interface between two Betacam-expressing cells.

Referring now to the invention in more detail, FIGS. 75 and 76 describes results stably overexpressing a V5-tagged full length Betacam construct in both CHO and Att20 cells. A Western blot, using the V5 reacting monoclonal antibody, secondarily detected using HRP-conjugated anti-mouse antibodies is shown. Protein lysates from various independently selected stable clones of either cell line was loaded. Expression of Betacam varied, as expected, between individual clones. Results from the CHO expressing lines were not referred to in more detail. Att20 clones expressing Betacam-V5 were those named 11F and 12E, where 12E expressed lower total levels than 11F. Several G418 resistant clones did not express Betacam-V5 at detectable level (Clone 11A, 11C, 11F, 12B, 12F). Clone 12E was selected for further analysis.

Referring now to the invention in more detail, imaging of Att-20 clone 12E was performed using the V5 added tag to Betacam. Left image in FIG. 77 revealed widespread presence of immunofluorescence. This was most notable at cell/cell boundaries. The non-transfected control population (WT, rightmost) did not show a similar staining.

Another imaging result of Att20 clone 12E is shown in FIG. 78. This was represented in black/white to better reveal contrast. Brightly fluorescent specks were detected at cellular membrane/membrane connection points (both images of are of the Att20 transfected population, they represent different fields-of-view).

FIG. 79 shows FACS analysis of Betacam in CHO cells.

Example 9 Cellular Aggregation Properties of Betacam

HEK 293FT cells were grown on culture dishes and washed three times with phosphate-buffered saline (PBS) without Mg²⁺ or Ca²⁺, and collected after brief exposure (1 min at 37° C.) to trypsin-EDTA (Sigma). After centrifugation, a single-cell suspension was obtained by passing cells through a cell strainer (40-μm pore size). The cells were suspended into DMEM medium containing 10% FCS at a density of 1×10⁶ cells/ml. The single-cell suspension was placed in 24-well plates pre-coated with 2% bovine serum albumin (BSA) and then rotated on a shaker at 37C for indicated periods (1 hr, 2 hr, 5 hr, overnight).

In a selected experiment, cells were suspended in normal Hanks' balanced salt solution (HBSS) containing Ca²⁺ and Mg²⁺ (Sigma), or Ca²⁺- and Mg2+-free HBSS (Sigma) containing 2 mM EDTA. Both HBSS solutions were supplemented with bovine serum albumin (BSA) at a final concentration of 2%. FIG. 80 shows a result in which pBETACAM-EGFP expression led to cell aggregation, detectable at 5 h, whereas transfection with pEGFP resulted in no difference in aggregation properties.

In a selected experiment, two individual labeled Betacam forms (pBetacam-EGFP and pBetacam-DsRed) were transfected into HEK293FT cells; control transfections were performed using pEGFP-N1 and pDsRed-N1. The individual cells were subjected to the aggregation protocol as described earlier. FIG. 81 shows the aggregation of both red, and green, labeled Betacam-expressing cells. No aggregation was observed using the EGFP and DsRED proteins.

Referring now to the invention related to embodiments based on the Betacam peptide, an alternative method could be considered if a similar, but different gene as compared to Betacam exists which creates, upon its transcription and translation of its mRNA, a surface epitope present with high selectivity on the beta-cell surface. However, the inventors have reason to claim such a molecule is unlikely to exist. These claims are based on the following observations. 1. The genomics-based bioinformatics analysis was performed using the MOE-430v2 Affymetrix platform, which covers >90% of the transcribed genome (>40000 features are present on the platform). 2. The definition of tissue-specific expression towards pancreatic b-cells, also having selective expression of genes within the endocrine compartment of the pancreas was performed to lead to a much-reduced subset of genes (<200). 3. Within this list, all genes were manually curated for membrane localization, presumed function and novelty. 4. Only a single gene emerged, Betacam, consequently being the one exhibiting the highest degree of tissue-specificity for the pancreatic islet, beta-cells in particular, also being expressed on the surface of such cells, and representing a novel, uncharacterized gene. These criteria were fulfilled for both the mouse, and human Betacam genes and their translated products.

The embodiments of the invention are related to the utilization of Betacam as a novel, surface-expressed epitope, which normal pancreatic beta cells have. When combined with the predicted function of Betacam as a novel homotypic cell adhesion molecule, the various embodiments described are reagent definitions and formulations based on the advantageous features that such a unique component present on such cells provide.

The advantages of the present invention include, without limitation, the formulation of a reagent set capable of tracking a normal, as well as dysfunctional beta cells under several varying conditions. The advantage of the present invention is provided by our findings that a similar molecule is not known to exist. Our bioinformatics analysis suggests that an alternative molecule is unlikely to exist.

The advantages of the present invention lie in the recognition that there is no method to detect islet cells following transplantation in-vivo in a non-invasive manner.

The advantages of the present invention lies in the recognition that there is no specific purification step of the pancreatic beta cell type included current islet transplantation, as this is based on gradient purification of total islets, and results in quite crude cell populations.

The advantages of the present invention lie in the recognition that there is no non-invasive method to purify pancreatic beta cells from e.g. forward-differentiated human ES cells.

hypothetical protein LOC253012 isoform 1 [Homo sapiens] (Human). RED SEQUENCE CORRESPOND TO Predicted pre-peptide. SEQUENCE ID: 1

hypothetical protein LOC253012 isoform 2 [Homo sapiens] (human) RED SEQUENCE CORRESPOND TO Predicted pre-peptide. SEQUENCE ID: 2

hypothetical protein LOC101202 [Mus musculus] RED SEQUENCE CORRESPOND TO Predicted pre-peptide. SEQUENCE ID: 3

hypothetical protein LOC296846 [Rattus norvegicus] (rat) RED SEQUENCE CORRESPOND TO Predicted pre-peptide. SEQUENCE ID: 4

hypothetical protein LOC513430 [Bos taurus] .(Cow) RED SEQUENCE CORRESPOND TO Predicted pre-peptide SEQUENCE ID: 5

similar to CG12369-PA, isoform A [Canis familiaris] (Dog) RED SEQUENCE CORRESPOND TO Predicted pre-peptide SEQUENCE ID: 6

PREDICTED: hypothetical protein [Danio rerio], Zebrafish RED SEQUENCE CORRESPOND TO Predicted pre-peptide SEQUENCE ID: 7

PREDICTED: hypothetical protein [Equus caballus] (Horse) RED SEQUENCE CORRESPOND TO Predicted pre-peptide SEQUENCE ID: 8

PREDICTED: hypothetical protein [Gallus gallus] (chicken) RED SEQUENCE CORRESPOND TO Predicted pre-peptide SEQUENCE ID: 9

PREDICTED: hypothetical protein [Pan troglodytes] (monkey) RED SEQUENCE CORRESPOND TO n-TERMINAL EXTENSION NOT CONTAINING A PRE-PEPTIDE SEQUENCE ID: 10

PREDICTED: hypothetical protein [Macaca mulatta] (Monkey) RED SEQUENCE CORRESPOND TO n-TERMINAL EXTENSION NOT CONTAINING A PRE-PEPTIDE SEQUENCE ID: 11

unnamed protein product [Tetraodon nigroviridis] (pufferfish) SEQUENCE ID: 12

Lachesin, Drosophila melanogaster (fruit fly) SEQUENCE ID: 13

Extracellular domain of Betacam [Mus Musculus] (Mouse) SEQUENCE ID: 14

Extracellular domain of Betacam [Homo sapiens] (Human) SEQUENCE ID: 15

Betacam-DsRed2 [Mouse] as in pIRES2-DsRED2 BLUE SEQUENCE: Betacam portion BLACK SEQUENCE: linker peptide RED SEQUENCE: DsRED2 sequence SEQUENCE ID: 16

Betacam-nEGFP [Mouse] as in pIRES2-EGFP BLUE SEQUENCE: Betacam portion BLACK SEQUENCE: linker peptide RED SEQUENCE: EGFP sequence SEQUENCE ID: 17

pFUSE-FcOnly, BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide. SEQUENCE ID: 18

pFUSE-Fc-D1, BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide, GREEN SEQUENCE IS BETACAM DERIVED. SEQUENCE ID: 19

pFUSE-Fc-D2 BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide, GREEN SEQUENCE IS BETACAM DERIVED. SEQUENCE ID: 20

pFUSE-Fc-D3 BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide, GREEN SEQUENCE IS BETACAM DERIVED. SEQUENCE ID: 21

pFUSE-Fc-D1/D2 BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide, GREEN SEQUENCE IS BETACAM DERIVED. SEQUENCE ID: 22

pFUSE-Fc-D2/D3 BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide, GREEN SEQUENCE IS BETACAM DERIVED. SEQUENCE ID: 23

pFUSE-Fc-D1/D2/D3 BLUE SEQUENCE IS Glutathione S- transferase, Red sequence is IL2R signal peptide, GREEN SEQUENCE IS BETACAM DERIVED. SEQUENCE ID: 24

The teachings of all patents, published applications and references cited herein are incorporated by reference in their entirety.

While this invention has been particularly shown and described with references to example embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims. 

What is claimed is:
 1. An isolated nucleic acid that encodes an amino acid sequence of Betcam wherein the amino acid sequence consists essentially of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO: 2 or amino acids 30 through 463 of SEQ ID NO:
 3. 2. An isolated nucleic acid that encodes an amino acid sequence of an extracellular domain of Betacam, wherein the amino acid sequence consists essentially of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO:
 15. 3. An isolated nucleic acid consisting essentially of SEQ ID NO:
 26. 4. An isolated polypeptide that consists essentially of amino acids 31 through 462 of SEQ ID NO: 1, amino acids 19 through 450 of SEQ ID NO:2 or amino acids 30 through 463 of SEQ ID NO:
 3. 5. An isolated polypeptide that consists essentially of SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO:
 15. 6. An antibody that has binding specificity for the polypeptide of claim 4 or
 5. 7. A method of detecting beta cells in a mixture of pancreatic cells comprising detecting the presence of a polypeptide on the surface of the cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15, and detection of expression of the polypeptide on the surface of the cells indicates that the cells are pancreatic beta cells.
 8. The method of claim 7 further comprising isolating the pancreatic beta cells from the mixture of cells.
 9. The method of claim 8 wherein the pancreatic beta cells are isolated from a biological sample.
 10. The method of claim 7 wherein the biological sample is pancreatic tissue.
 11. The method of claim 10 wherein the pancreatic tissue is obtained from a cadaver.
 12. The method of claim 7 wherein expression of the polypeptide is detected using an antibody that has binding affinity for the polypeptide.
 13. A method of detecting pancreatic beta cells in an individual in need thereof, comprising administering to the individual an agent that detects the presence of a polypeptide on the surface of the pancreatic beta cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO:
 15. 14. The method of claim 13 wherein the individual is being screened for a risk of developing diabetes.
 15. The method of claim 13 wherein the individual has diabetes.
 16. The method of claim 15 wherein the diabetes is Type I diabetes or Type II diabetes.
 17. The method of claim 13 wherein the individual has had an islet cell transplantation.
 18. A method of isolating pancreatic beta cells from a mixture of pancreatic cells comprising: a) contacting the mixture with a reagent that specifically binds to a polypeptide present on the surface of pancreatic beta cells, wherein the polypeptide comprises SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15, thereby producing a combination; b) maintaining the combination under conditions in which the reagent binds to the polypeptide present on the surface of the pancreatic beta cells, thereby producing a complex of pancreatic beta cells bound to the reagent; and c) separating the complex from the combination, thereby isolating pancreatic beta cells from the mixture of pancreatic cells.
 19. The method of claim 18 further comprising d) separating the pancreatic beta cells from the reagent.
 20. The method of claim 18 wherein the mixture of pancreatic beta cells is pancreatic tissue.
 21. The method of claim 20 wherein the pancreatic tissue is obtained from a cadaver.
 22. The method of claim 18 wherein the reagent is an antibody that has binding affinity for the polypeptide.
 23. A method of identifying an agent that modulates the biological activity of betacam comprising: a) contacting a composition comprising a polypeptide, wherein the polypeptide has an amino acid sequence comprising SEQ ID NO: 14, SEQ ID NO: 15, an amino acid sequence that has at least 50% identity to SEQ ID NO: 14 or an amino acid sequence that has at least 50% identity to SEQ ID NO: 15 with an agent to be assessed; b) measuring the biological activity of the polypeptide in the presence of the agent compared to a suitable control, wherein if the polypeptide modulates the activity of the polypeptide in the presence of the agent compared to the control, then the agent modulates the biological activity of betacam.
 24. The method of claim 23 wherein the composition is one or more pancreatic beta cells.
 25. The method of claim 24 wherein the biological activity measured is homotypic cell adhesion between betacam-expressing pancreatic beta cells.
 26. The method of claim 24 wherein the control comprises pancreatic beta cells which have not been contacted with the agent to be assessed. 